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'Tumor Necrosis Factor Receptors 5, 6a & 6(J 

Field of the Invention 

The present invention relates to novel human genes encoding 
pol> peptides u hich are members of the TNF receptor family. More 
specifically, isolated nucleic acid molecules are provided encoding human 
polypeptides named tumor necrosis factor receptor-5. -6a & -6(3 hereinatter 
sometimes referred to as "TNFR-5. -6a. & -6p" or generically as "TNFR 
polypeptides" \ TNFR polypeptides are also provided, as are vectors, host cells 
and recombinant me: hods for producing the same. The invention further relates 
to screening methods for identifying agonists and antagonists ot TNFR 
poivpepude acti\ lty. Also provided are diagnostic and therapeutic methods 
utilizing such compositions. 

Background of the Invention 

Many biological actions, for instai.ee. response to certain stimuli and 
natural biological processes, are controlled by factors, such as cytokines. Many 
cvtokme-. act through receptors by engaging the receptor and producing an intra- 
cellular response. 

For ex-mple. tumor necrosis factors (TNF) alpha and beta are cvtokmes 
which act through TNF receptors to regulate numerous biological processes, 
including protection against infection and induction of shock and inflammatory 
disease. The TNF molecules belong to the TNF-ligand" supenamtlv . and act 
together with their receptors or counter-hgands. the "TNF-recepior 
supcrfamiK . So far. nine members of the TNF ligand superfamily have been 
identified and ten members of the TNF-receptor supcrfamilv have been 
characterized. 

Among the ligaiids there are included TNF-u. Iv mp!v>:o\m U il.T-n. 
known as TM--(J». F T [5 1 found in complex heterotrimer I.T-r/2-|5 >. Fasl.. 
( i;4i»L. CD:Tl.. C'D.^nL. 4 -IBBF. ()\4U|_ and nerve growth factor t NC : F>. 
I he sujvK.umlv .»i I NF receptors includes the p55TNF receptor. pT VfNF* 
*iUcpi«n. i \| | U cpi'H lelaied protein. I AS antigen or AIU )■ I . C\Ui). CD1~. 



CD30. 4-IBB. OX40. low affinity p75 and NGF-rceeptor (Meager. A., 
Biologicals. 22:291.295 ( 1994)). 

Man\ members of the TNF-ligand supcrfamily are expressed by 
activated T-cei'U. implying that they are necessary for T-cell interactions with 
other cell types which underlie cell ontogeny and functions. (Meager, A., 
supra). 

Considerable insight into the essential functions of several members of 
the TNF receptor family has been gained from the identification and creation of 
mutants that abolish the expression of these proteins. For example, naturally 
occurring mutations in the FAS antigen and its ligand cause lymphoprolifcrative 
disease t Watar.abe-Fukunaga. R.. et al.. Nature 356:3 14 ( 1992)). perhaps 
reflecting a failure of programmed cell death. Mutations of the CD40 ligand 
cause an X-Iinked immunodeficiency state characterized by high levels of 
irmrunoclobulin M and low levels of immunoglobulin G in plasma, indicating 
fault v T-eell-depcndent B-cel! activation (Allen. R.C. et al.. Science 259:990 
( 1993 1). Targeted mutations of the low affinity nerve growth factor receptor 
cause a disorder characterized by faulty sensory innovation of peripheral 
structures (Lee. K.F. et al.. Cell 63:737 ( 1992)). 

TNF and LT-a arc capable of binding to two TNF receptors (the 55- 
and 75-kd TNF receptors). A large number of biological effects elicited by 
TNF and LT-a. acting through their receptors, include hemorrhagic necrosis of 
transplanted tumors, cytotoxicity, a role in endotoxic shock, inflammation, 
imniunoregulatir-n. proliferation and anti-viral responses, as well as protection 
aeumst the deleterious eliec^ of ionizing radiation. TNF and LT-a arc involved 
in the pathogenesis of a wide range of diseases, including endotoxic shock, 
cerebral malaria, minors, autoimmune disease. AIDS and graft-host rejection 
* Benslei . B. anJ Von Huffol. C Science 264:667-668 ( 1994)). Mutations in 
the p55 Receptor cause increased susceptibility to microbial infection. 

Moreover, an about HO amino acid domain near the C-ierminus of 
TNI R \ tp55i and Fas was reported as the "death domain." which is 
responsible lor Transducing signals for programmed cell death (Tartagliaet al.. 
Cell 74:K45 ( IWll. 

ApopteMv or programmed cell death, is a physiologic pnvess essential 
tn the riiniual development and homeostasis of multicellular organisms ( M. 
Sieller. Science 267. I445-144*J( |W5)». Derangements of apoptosis 
L«»nmbute to the pathogenesis *if several human di-.eases including cancer. 
nciii.Kk-jeneiativc disoulciv and acquued immune deficiency svndrome tC'.B. 
I h< .tups. mi. iuiu- 2o7. I JVi 1462 ' l'> u 5i». Recenily. much attention has 



focused on the signal transduction and biological function of two cell surface 
death receptors. Fas/APO-1 and TNFR-1 (J.L. Cleveland, J.N. Ihle.Cell 81, 
479-482 ( 1995): A. Frascr. G. Evan. Cell 85, 781-784 ( 1996); S. Nagata. P. 
CoNtcin. Science 267. 1449-56(1995)). Both are members of the TNF 
5 receptor family which also include TNFR-2, low affinity NGFR, CD40, and 

CD30. among others (C.A. Smith, et al.. Science 248, 1019-23 ( 1990); M. 
Tewan. V.M. Dixit, in Modular Texts in Molecular and Cell Biology M. 
Purton. Heldin. Carl, Ed. (Chapman and Hall, London. 1995). While family 
members are defined by the presence of cysteine-rich repeats in their 
in extracellular domains. Fas/APO-1 and TNFR-1 also share a region of 

intracellular homology, appropriately designated the "death domain", which is 
distantly related to the Dros'ophila suicide gene, reaper (P. Golstein. D. 
Marguet, V. Depraetere. Cell 81. 185-6 ( 1995); K. White et al.. Science 264, 
677-83 ( 1994)). This shared death domain suggests that both receptors interact 
1 5 with a related set of signal transducing molecules that, until recently, remained 

unidentified. Activation of Fas/APO-1 recruits the death domain-containing 
adapter molecule FADD/MORT1 (A.M. Chinnaiyan. K. O'Rourke. M. Tewari. 
V. M. Dixit. Cell 81. 505-12 (1995); M. P. Boldin. et al., J. Biol Chem 270. 
7795-8 ( 1995): F.C Kischkel. et al.. EM BO 14. 5579-5588 ( 1995)). which in 
:n turn binds and presumably activates FLICE/MACH I , a member of the 

ICE'CED-3 family of pro-apoptotic proteases (M. Muzio et al.. Cell 85. 817- 
827 i 1996): M P. Boldin. T.M. Goncharov. Y.V. Goltsev, D. Wallach. Cell 
85. 803-815 ( 1996)). While the central role of Fas/APO-1 is to trigger cell 
death. TNFR-1 can signal an array of diverse biological activities-many of 
25 which Mem from its ability to activate NF-kB (L.A. Tartaglia. D.V. Goeddel. 

Immunol Today 13. 151-3(1992)). Accordingly. TNFR- 1 recruits the 
multivalent adapter molecule TR ADD. which like FADD. also contains a death 
domain < H. FKu. J. Xiong. D.V. Goeddel. Cell 81. 495-504 ( 1995): H. Hsu. 
H.-B. Shu. M.-P. Pan. D.V. Goeddel. Cell 84. 299-308 ( 1996)). Through its 
associations * uh a number of signaling molecules including FADD. TRAF2. 
and RIP. TR ADD can signal both apoptosis and NF-kB activation ( H. H>u. H - 
B Shu. M.-P Pan. D.V. Goeddel. Cell 84. 299-308 ( 1996); H. H>u. J. 
Huang. H.-B Shu. V. Baichwal. D.V. Goeddel. Immunity 4. 387-396 

( 1 VW/i , J 
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Th«ffects of TNF fWil^gands ? ^fNF family rec^G^^^ricc} * 
and influence numerous functions, both normal and abnormal, in the biological 

processes of the mammalian system. There is a clear heed, therefore, for 
identification and characterization of such receptors and hg:inds that intfuence 
biological activity, both normally and in dtsease states. In particular, there is a 
need to isolate and characterize novel members of the TNF rcceptorfamily. 



Summary of the Invention 

The present invention provides isolated nucleic acid molecules 
comprising a polynucleotide encoding at least a portion of a TNFR-5, -6a or - 
10 6{5 polypeptide having the complete amino acid sequences shown in SEQ ID 

NOS:2. 4 and 6. respectively, or the complete amino acid sequence encoded by 
a cDNA clone deposited in a bacterial host as ATCC Deposit Number 97798, 
97810 and 97809, respectively. The nucleotide sequence determined by 
sequencing the deposited TNFR-5, -6a and -6p clones, which are shown m 
15 Figures 1, 2 and 3 (SEQ ID NOS:l. 3 and 5. respectively), contain open 

reading frames encoding complete polypeptides of 259, 300 and 170 amino 
acid residues, respectively, including an initiation codon encoding an 
N-terminal methionine at nucleotide positions 183-185, 25-27, and 73-75 in 
SLQ ID SOS: 1. 3. and 5 respectively. 
;u The TNFR proteins of the present invention share sequence homology 

with other TNF receptors. TNFR-5 shows the highest degree of sequence 
homology with the translation product for the human mRN A for nerve growth 
factor receptor (Figure 4) (SLQ ID NO:9), including multiple conserved 
cystiene rich domains Splice variants TNFR-6a and -60 show the highest 
2< degree of sequence homology with the translation products of the human 

mRN As tor INI K I and -11 (Figure 4) (SI Q II) NOS:7 and X. respectively) 
aUo including multiple conserved cysteine rich domains. 

The TNFR-5, -6u and -f>ti polypeptides have predicted leader 
sequences <>t 2<v 30 and 30 ammo acids, respectively, and the amino acid 
sequence ot ihe predicted mature TNFR-5, -ftu and -f>P polypeptides are also 




300 (SEQ ID NO:4). and 31-170 (SEQ ID NO:6), respectively. 

Thus, one aspect of the invention provides an isolated nucleic acid 
molecule comprising a polynucleotide having a nucleotide sequence selected 
from the group consisting of: (a) a nucleotide sequence encoding a TNFR 
polypeptide having the complete amino acid sequence in SEQ ID NO:2. 4 or 6. 
or as encoded by the cDNA clone contained in ATCC Deposit No. 97798, 
9*7810 or 97809; (b) a nucleotide sequence encoding a mature TNFR 
polypeptide having the amino acid sequence at positions 27-259 in SEQ ID 
NO:2. 3 1 -300 in SEQ ID NO:4. or 3 1- 1 70 in SEQ ID NO:6, or as encoded by 
the cDNA clone contained in ATCC Deposit No. 97798, 97810 or 97809; (c) 
a nucleotide sequence encoding a soluble extracellular domain of a TNFR 
polypeptide having the amino acid sequence at positions 27 to 240 in SEQ ID 
NO:2. 31 to 283 in SEQ ID NO:4, or 31 to 166 in SEQ ID NO:6. or as 
encoded by the cDNA cicne contained in the ATCC Deposit No. 97798. 
97810 or 97X09; and (d) a nucleotide sequence complementary to any of * u .c 
nucleotide sequences in (a), (b). or (c) above. 

Further embodiments of the invention include isolated nucleic acid 
molecules that comprise a polynucleotide having a nucleotide sequence at least 
90° o identical, and more preferably at least 95%. 9f>%. 97%. 98% or 99% 
identical, to any of the nucleotide sequences in (a), (b). (c) and (d) above, or a 
polynucleotide which hybridizes under stringent hybridization conditions to a 
polynucleotide in (a). <b). to or (d). aho* e. This polynucleotide which 
hyhndi/cs does not hybridize under stringent hybridization conditions to a 
polynucleotide having a nucleotide sequence consisting of only A residues or 
ot 1 residues An additional nucleic acid embodiment of the invention 

rri„:c- to an ixoiated iuilIck acid molecule comprising a poKnucleotide which 
criLfde- ihe amino acid sequence of an epttope-bearmg pomou of a INrR 
p'.li .pcpiulc ha\ m;j an amino a ud scqiiciue in < O. lb) or (O. abo\e 



The present invention also relates to recombinant vectors, which 
include the isolated nucleic acid molecules of the present invention, and to host 
cells containing the recombinant vectors, as well as to methods of making such 
vectors and host cells and for using them for production of TNFR 
polypeptides or peptides by recombinant techniques. 

The invention further provides an isolated TNFR polypeptide 
comprising an amino acid sequence selected from the group consisting of: (a) 
the ammo acid sequence of a full-length TNFR polypeptide having the 
complete amino acid sequence shown in SEQ ID NO:2. 4 or 6 or as encoded 
by the cDNA clone contained in ATCC Deposit No. 97798. 97810 or 9^809; 
<b> the amino acid sequence of a mature TNFR polypeptide having the amino 
acid sequence at positions 27-259 in SEQ ID NO:2. 31-300 in SEQ ID NO:4. 
or 31-1 "0 in SEQ ID NO:6. or as encoded by the cDNA clone contained in 
ATCC Deposit No. 97798. 97810 or 97809; or (c) the amino acid sequence of 
a soluble extracellular domain of a TNFR polypeptide having the amino acid 
sequence at pos.tions 27 to 240 in SEQ ID NO- 3 1 to 283 in SEQ ID NO:4. 
or 3 1 to 166 m SEQ ID NO:6. or as encoded by the cDNA clone contained tn 
ATCC Deposit No 9^98. 9^810 or 9"'809. 

The polypeptides of the present invention also include polypeptides 
hav ing an amino acid sequence at least 80% identical, more preferably at least 
v.)".. identical, and still more preferably 95V 96%. 97%. 98% or 99% identical 
to those described in (a), (b) oriel above, as well as polypeptides having an 
amino acid sequence w ith at least 90% similarity , and more preferably at least 
95".. similarity to iIionl above 

An additional embodiment of this aspect of the invention relates to a 
peptide or polypeptide *hich comprises the amino acid sequence ot an 
Lpit,;vK\>rmj pinion ot a IM K pol> peptide having an amino acid 
.eq;.e:^e described in <ai. «bi oriel, above Peptides or polypeptides hav mg 
:ui -an. mo a^td sequence ol an epitope-bcanng portion of a I NI R polypeptide 
i.t the m,e!iti"ii nulude portions ,.t such polypeptides with at least m\ or 




seven, preferably at least nine, and more preferably at least about 30 amino 
acids to about 50 amino acids, although cpitopc-bcaring polypeptides of any 
lencth up to and including the entire amino acid sequence of a polypeptide of 
the invention described above also are included in the invention. 

In another embodiment, the invention provides an isolated antibody 
•hat binds specifically to a TNFR polypeptide having an amino acid sequence 
described in (a), (b) or (c) above. The invention further provides methods for 
isolating antibodies that bind specifically to a TNFR polypeptide having an 
amino acid sequence as described herein. Such antibodies are useful 
diagnostically or therapeutically as described below. 

Tumor Necrosis Factor (TNF) family ligands arc knou.i to be among 
the most pleiorropic cytokines, inducing a large number of cellular responses, 
including cytotoxicity, anti-viral activity, immunoregulatory activities, and the 
transcriptional regulation of several genes. The invention also provides for 
pharmaceutical compositions comprising TNFR polypeptides, particularly 
human TNFR polypeptides, which may be employed, for instance, to treat 
infectious disease including HIV infection, endotoxic shock, cancer, 
autoimmune diseases, graft vs. host disease, acute graft rejection, chronic graft 
rejection, neurodegenerative disorders, myelodysplastic syndromes, ischemic 
injur> . toxin-induced I' er disease, septic shock, cachexia and anorexia. 
Method- of treating individuals in need of TNFR polypeptides are also 
pr» a ided 

The invention further provides compositions comprising a TNFR 
polynucleotide or a TNFR polypeptide for administration to cells in vitro, to 
cells, i \i\ t , and to cells m ww*. or to a multicellular organism. In certain 
partiwul.nl> prctcned embodiments ot tln> aspect of the invention, the 
v »n;p- --in* i« impure a 1 M K pol\ nucleotide for expression ot a I SI R 
;vl;. peptic m a host organism tor treaiment of disease. Particularly preferred 
in ti.i* rce.nd h c\pic--i«m in a human patient tor ticaiinenl of a dxslunctum 
as.oi:atcd Attn .men. ml endogenous activity tit" a I NI R pol\ peptide. 



In another aspect, a screening assay for agonists and antagonists is 
provided which involves determining the effect a candidate compound has on 
TNFR polypeptide binding to a TNF-family ligand. In particular, the method 
involves contacting the TNF-family ligand with a TNFR polypeptide and a 
candidate compound and determining whether TNFR polypeptide binding to 
the TNF-family ligand is increased or decreased due to the presence of the 
candidate compound. In this assay, an increase in binding of a TNFR 
polypeptide over the standard binding indicates that the candidate compound 
is an agonist of TNFR polypeptide binding activity and a decrease in TNFR 
polypeptide binding compared to the standard indicates that the compound is 
an antagonist of TNFR polypeptide binding activity. 

It has bc.en discovered that TNFR-5 is expressed not only in prostate 
tissue but also in endothelial ceils, stimulated monocytes and kerotinocytcs. 
TNFR-*a and -6(3 arc expressed endothelial eel is. keratinocytes, normal 
prostate and prostate rumor tissue. For a number of disorders of theses 
tissues or cell, particularly of the immune system significantly higher or lower 
levels of TNFR gene expression may be detected in certain tissues (e.g.. 
cancerous tissues) or bodih fluids (e.g.. serum, plasma, unne. synovial tluid or 
spnu! tluidi taken from an mdi\ idual having such a disorder, relative to a 
"standard" TNFR gene expression level, i.e.. the TNFR expression level in 
healthv :i»ue from an individual not having the immune system disorder. 
Thus, the invention provides a diagnostic method useful during diagnosis of 
such a disorder, uhich involves (a) a»aving TNFR gene expression level in 
cells or bod> fluid of an individual, (hi comparing the TNFR gene expression 
ie.el va ith a standard INI K gene expression level, wherebv an increase or 
d-.vrca^e in the a slaved INI K gene expression level compared to the standard 
expression level is mduauv e o! disorder in the immune sxstem. 

\"h .i-.ldm. .i.jvvi ot (he invention is iclated to a method foi treating 
.in i.-idi i in need .»t .in iiu leased level of I MR po!\ peptide activity in the 
!».»!;. c iti[<i iMn : j .idiinnistermg to such an individual a c imposition comprising 
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invention or an agonist thereof. : 

A still further aspect ofethc invention is related to a method for treating 
an individual in need of a decreased level of TNFR polypeptide activity in >hc 
body comprising, administcringto such an individual a composition 
comprising a therapeutically effective amount of a TNFR antagonist. 
Preferred antagonists for use in the present invention are TNFR-specific 
antibodies. 

Brief Description of the Figures 

Figure 1 shows the nucleotide sequence (SEQ ID NO: 1 ) and deduced 
amino acid sequence (SEQ ID NO:2) of TNFR-5, 

Figure 2 shows the nucleot-de sequence (SEQ ID NO:3) and deduced 
amino acid sequence (SEQ ID NO:4) of TNFR-6a. 

Figure 3 shows the nucleotide sequence (SEQ ID NO:5) and deduced 
ammo acid sequence (SEQ ID NO:6) of TNFR-6f}. 

Figure 4 shows an alignment created by the Clustal method using the 
Megahne program in the DNAstar suite comparing the atr* no acid sequences 
of TNFR-5 ("TNFR-like"). TNFR -6a ("TNFR-fia"). and 7NFR-6P ("TNFR- 
hb"i with other TNF receptors, as follows: TNFR 1 (SEQ ID SO:7); TNFR2 
(SFQ ID NO>>: NCr R (SEQ ID SO 9): LTbR (SEQ ID NO: MM. FAS 
(SI Q ID NO ID: CD2"' (SEQ ID NO 1 2 ): CD30 ( SEQ ID NO I 3); CD40 
iSI QIDNO 14k 4-lBB(SI QlDNO 15»: OX40 (SEQ ID NO: 1 ft); VC22 
IS! Q ID NO r». and ( RMB I SliQ ID NO IX). 

higurcN 5. * and r show separate :nal\ses of the TNFR-5. ftf/ and h\S 
amino acid sequences, respect i\ cly. Alpha, beta, turn and coil regions; 
lr.d:<';v.iiiiil\ and h> drophi bicitv . jmphipathic regions. flexible regions. 
aii!!.--::u tink v and siirfaLC prt ►h.thil ii> are shown In the "Antigenic Index - 
l.i ii (i .ii \\ graphs, the militate location of the Li JiK au'ieeirc regions ol 



the proteins, i.e.. regions from which epitopc-bcaring peptides of the invention 

may he obtained. 

Figure S shows the nucleotide sequence of fragments related to the 
TNFR genes of the present invention, including: HPRCB54R (SEQ ID 
5 NO:!9). H3JAU5 -RA (SEQ ID NO20). HELBP70R (SEQ ID NO:21 ), and 

HL SCB54R (SEQ ID NO:22) all of which are related to SEQ ID NO: 1 ; and 
HELDI06R (SEQ ID NO 23) and HCECW38R (SEQ ID NO:24) both of 
which are related to SEQ ID NCS:3 and 5. 

Detailed Description 

in The present invention provides isolated nucleic acid molecules 

comprising a polynucleot.de encoding a TNFR-5. -6a or -6p polypeptide, 
genencally "TNFR polypcptidc(sr having the amino acid sequence shown in 
SEQ ID NOS 2. 4 and 6. respectively, which were determined by sequencing 
cloned cDNAs. The nucleotide sequences shown in Figures L 2 and 3 (SEQ 
i< ID NOS 1. 3 and 5) were obtained by sequencing the HPRCB54. HPHAE52 

and HTPCIIS4 clones, which were deposited on November 20. i996, 
November 22. 1996. and November 22, 1996, respectively, at the Amencan 
Typo Culture Collection. 12301 Park Lawn Drive. Rockvtlle. Maryland 
20852. and given accession numbers ATCC 97798. 97810 and 97S09. 
;., respectively The deposited clones are contained in the pBlucscript SK(-) 

pla>miJ f Stratagene. La Jolte. CA). 

The TNFR-5 protein of the present invention has an amino acid 
sequence which is 21 .7% identical to and shares multiple conserved cysteine 
rich domains with the translation product of the human nerve growth factor 
: , ihNCil ) inRNA (SI.Q ID NO: 1 *) as illustrated in Figure 4. hNCiF is thought to 

plj. .in important role in the development, survival, apoptosts and function of 
neuron* < Lee. Y K et al.. Cell 69:737) and lymphocytes (Torcia. M. et al.. Cell 
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The TNFR-6ct and -6p proteins of the present invention arc splice 
variants which share an identical nucleotide and amino acid sequence over the 
N-terminal 142 residues of the respective proteins. The amino acid sequences 
of these proteins arc about 23% similar to and share multiple conserv ed 
cysteine rich domains with the translation product of the human TNFR-2 
mRNA (Figure 4) (SEQ ID NO:8). Importantly, these proteins share 
substantial sequence similarity over their extracellular domains including four 
repeated cysteine rich motifs with significant intcrsubunit homology. TNFR-2 
is thought to exclusively mediate human T-cell proliferation by TNF (PCT 
WO 94 09137). 

Nucleic Acid Molecules 

Unless otherwise indicated, all nucleotide sequences determined by 
sequencing a DNA molecule herein were determined using an automated DN A 
sequencer (such as the Model 373 from Applied Biosystems, Inc., Foster 
City. CA), and all amino acid sequences of polypeptides encoded by DNA 
molecules determined herein were predicted by translation of a DNA sequence 
determined as above. Therefore, as is known in the art for any DNA sequence 
determined by this automated approach, any nucleotide sequence determined 
herein may contain some errors Nucleotide sequences determined by 
automation are typically at least about 90% identical, more typically at least 
about 95% to at least about 99.9% identical to the actual nucleotide sequence 
of the sequenced DNA molecule. The actual sequence can be more precisely 
determined by other approaches including manual DNA sequencing methods 
well known in the art. As is also known in the art, a single insertion or 
deletion in a detennined nucleotide sequence compared to the actual sequence 
will cause a frame shift in translation of the nucleotide sequence such that the 
predicted amino acid sequence encoded by a determined nucleotide sequence 
will be completely different from the amino acid sequence actually encoded by 



the sequenced DNA molecule, beginning at the point of such an insertion or 
deletion. 

By "nucleotide sequence" of a nucleic acid molecule or polynucleotide 
is intended, for a DNA molecule or polynucleotide, a sequence of 
dcoxyribonuclcotidcs, and for an RNA molecule or polynucleotide, the 
corresponding sequence of ribonucleotides (A, G, C and U), where each 
thymidine dcox> ribonucleotide (T) in the specified deoxyribonucleotidc 
sequence is replaced by the ribonucleotide uridine (U). 

Using the information provided herein, such as the nucleotide 
sequences in Figures 1, 2 and 3 (SEQ ID NOS: I, 3 and 5), a nucleic acid 
mol-cule of the present invention encoding a TNFR polypeptide may be 
obtained using standard cloning and screening procedures, such as those for 
cloning cDNAs using mRNA as starting material. Illustrative of the invention, 
the TNFR-5 nucleic acid molecule described in Figure 1 was discovered in a 
cDNA library derived from prostate tissue. Additional clones of the same 
gene were also identified in cDNA libraries from the following tissues: 
endothelial cells, stimulated monocytes, and kerotinocytes. TNFR-6ct and -6p 
clones (Figures 2 and 3. respectively) were identified in cDNA libraries from 
the following tissues: endothelial cells, keratinocytes. normal prostate tissue, 
and pros»ate tunior tissue. 

The determined nucleotide sequences of the TNFR cDNAs of Figures 
1 . 2 and 3 (SEQ ID NOS: 1 , 3 and 5) contain open reading frames encoding 
proteins of 259. 300 and 170 amino acid residues, with an initiation codon at 
nucleotide positions 183-185. 25-27, 73-75 of the nucleotide sequences in 
Figures 1. 2 and 3 (SEQ ID NOS:!, 3 and 5), respectively. 

The open reading frames of the TNFR -6a and -6(3 genes share 
sequence homology with the translation product of the human mRNA for 
TNFR -2. including the soluble extracellular domain of about residues 3 i -2S3 
of SEQ ID NO:4 and 31-166 of SEQ ID NO:6, respectively. The open reading 
frame of the TNFR-5 gene shares sequence homology with the translation 
product of the human mRNA for NGFR, including the following conserved 



domains: (a) a soluble extracellular domain of about 214 amino acids (residues 
27-240 of SEQ ID NO:2); and (b) a transmembrane domain of about 19 amino 
acids (residues 241-259 of SEQ ID NO:2). 

As one of ordinary skill would appreciate, due to the possibilities of 
sequencing errors discussed above, the actual complete TNFR polypeptides 
encoded by the deposited cDNAs, which comprise about 259, 300, and 170 
amino acids, may be somewhat longer or shorter. More generally, the actual 
open reading frames may be anywhere in the range of ±20 amino acids, more 
likely in the range of ±10 amino acids, of that predicted from the first 
methionine codon from the N-terminus shown in Figures 1, 2 and 3 (SEQ ID 
NOS: 1. 3 and 5), which is in- frame with the translated sequences shown in 
each respective figure. It will further be appreciated that, depending on the 
analytical criteria used for identifying various functional domains, the exact 
"address" of the extracellular and transmembrane domain(s) of the TNFR 
polypeptides may differ slightly from the predicted positions above. For 
example, the exact location of the extracellular domain in SEQ ID NO:2 may 
vary slightly (e.g., the address may "shift" by about 1 to about 20 residues, 
more likely about 1 to .tbout 5 residues) depending on the criteria used to 
define the domain. In this case, the beginning of the transmembrane domain 
and the end of the extracellular domain were predicted on the basis of the 
identification of the hydrophobic amino acid sequence in the above indicated 
positions, as shown in Figure 5. In any event, as discussed further below, the 
invention further provides polypeptides having various residues deleted from 
the N-terminus of the comp'ete polypeptide, including polypeptides lacking 
one or more amino acids from the N-terminus of the extracellular domain 
described herein, which constitute soluble forms of the extracellular domains of 
the TNFR -5. -6a & -6[3 proteins. 



Leader and Mature Sequences 

The amino acid sequences of the complete TNFR proteins include a 
leader sequence and a mature protein, as shown in SEQ ID NOS.2, 4 and 6. 
More in particular, the present invention provides nucleic acid molecules 
encod.ng mature forms of the TNFR proteins. Thus, according to the signal 
hypothesis, once export of the growing protein chain across the rough 
endoplasmic reticulum has been initiated, proteins secreted by mammalian cells 
have a signal or secretory leader sequence which is cleaved from the complete 
polypeptide to produce a secreted "mature" form of the protein. Most 
mammalian cells and even insect cells cleave secreted proteins with the same 
specificity. However, in some cases, cleavage of a secreted protein is not 
entirely uniform, which results in two or more mature species of the protein. 
Further, it has long been known that the cleavage specificity of a secreted 
protein is ultimately determined by the primary structure of the complete 
protein, that is, it is inherent in the amino acid sequence of the polypeptide. 
Therefore, the present invention provides a nucleotide sequence encoding a 
mature TNFR polypeptide having the amino acid sequence encoded by a 
cDNA clone contained in a host identified as ATCC Deposit No. 97798. 
978 1 0 or 97809. By the ''mature TNFR polypeptides having the amino 3cid 
sequence encoded by a cDNA clone in ATCC Deposit No. 97798, 97810. or 
97X09" is meant the mature form(s) of the protein produced by expression in a 
mammalian cell (e.g., COS cells, as described below) of the complete open 
reading frame encoded by the human DNA sequence of the clone contained in 
the vector in the deposited host. 

In addition, methods lor predicting whether a protein has a secretory 
leader as well as the cleavage point for that leader sequence are available. For 
instance, trv. i::L"hod of McGeoch ( Virus Res. 3:271 -286 ( 1985)) uses the 
information from a short N-tcrminal charged region and a subsequent 
uncharged region of the complete (unclcaved) protein. The method of von 
I Icinje iStu lcu At ids Res /4:46.S3-4690 (1986)) uses the information from :he 



residues surrounding the cleavage site, typically residues -13 to +2 where +1 
indicates the amino terminus of the mature protein. The accuracy of predicting 
the cleavage points of known mammalian secretory proteins for each of these 
methods is in the range of 75-80% (von Hcinjc, supra). However, the two 
methods do not always produce the same predicted cleavage point(s) for a 
given protein. 

In the present case, the deduced amino acid sequence of the complete 
TNFR polypeptides were analyzed by a computer program "PSORT\ 
available from Dr. Kenia Nakai of the Institute for Chemical Research, Kyoto 
University (sec K. Nakai and M. Kanehisa, Genomics 74:897-91 1 (1992)), 
which is an expert system for predicting the cellular location of a protein based 
on the amino acid sequence. As part of this computational prediction of 
localization, the methods of McGcoch and von Heinje are incorporated. The 
analysis of the TNFR amino acid sequences by this program provided the 
following results: TNFR-5, -6a & -60 encode mature polypeptides having the 
amino acid sequences of residues 27-259, 31-300 and 31-170 of SEQ ID 
NOS:2. 4 and 6, respectively. 

As indicated, nucleic acid molecules of the present invention may be in 
the form of RNA, such as mRNA, or in the form of DNA, including, for 
instance. cDNA and genomic DNA obtained by cloning or produced 
synthetically. The DNA may be double-stranded or single-stranded. 
Single-stranded DNA or RNA ma> be the coding strand, also known as the 
sen>e strand, or it may be the non-coding strand, also referred to as the 
anii-sen.se strand. 

By "isolated" nucleic acid molccule(s) is intended a nucleic acid 
molecule. DNA or RNA. which has been removed from its native environment 
l or example, recombinant DNA molecules contained in a vector arc considered 
isolated for the purposes of the present invention. Further examples ot 
isolated DNA molecules include recombinant DNA molecules maintained in 
heterologous host cells or purified (partially or substantially) DNA molecules 
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in solution. Isolated RNA molecules include in vivo or in vitro RNA 
transcripts of the DNA molecules of the present invention. Isolated nucleic 
acid molecules according to the present invention further include such 
molecules produced synthetically. 

5 Isolated nucleic acid molecules of the present invention include DNA 

molecules composing an open reading frame (ORF) with an initiation codon at 
position* 183-185, 25-27 73-75 of the nucleotide sequences shown in SEQ ID 
NOS:K 3 and 5 respectively. 

Also included are DNA molecules comprising the coding sequence for 

10 . the predicted mature TNFR polypeptides shown at positions 27-259, 3 1-300, 

and 31-170 of SEQ ID NOS:2, 4 and 6, respectively. 

In addition, isolated nucleic acid molecules of the invention include 
DNA molecules which comprise a sequence substantially different from those 
described above but which, due to the degeneracy of the genetic code, still 

i <; encode a TNFR protein. Of course, the genetic code and species-specific 

codon preferences are well known in the art. Thus, it would be routine for one 
skilled in the art to generate the degenerate variants described above, for 
instance, to optimize codon expression for a particular host (e.g., change 
codon* in the human mRNA to those preferred by a bacterial host such as £. 

;u colt). 

In another aspect, the invention provides isolated nucleic acid 
molecules encoding a TNTR polypeptide having an amino acid sequence 
encoded by the cDNA clone contained in the plasmid deposited as ATCC 
Deposit No 9779X. 97S 1 0, or 97R09 Preferably, this nucleic acid molecule 
2f will encode the mature polypeptide encoded by the above-described deposited 

cDNA clone. 

The invention further provides an isolated nucleic acid molecule having 
the nucleotide ^sequence shown in I'igure 1 , 2 or 3 (Sl*Q ID NO: I, 3 or 5) or 
the nucleotide sequence of the TNI R cDNAs contained in the above-described 
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deposited clones, or a nucleic acid molecule having a sequence complementary 
t one of the above sequences. Such isolated molecules, particularly ON A 
molecules, are useful as probes for gene mapping, by in situ hybridization with 
chromosomes, and for detecting expression of the TNFR genes in human 
tissue, for instance, by Northern blot analysis. 

The present invention is further directed to nucleic acid molecules 
encoding portions of the nucleotide sequences described herein as well as to 
fragments of the isolated nucleic acid molecules described herein. In particular, 
the invention provides polynucleotides having a nucleotide sequence 
representing the portion of SEQ ID NO:l, 3 or 5 which consist of positions 
183-959, 25-924 and 73-582 of SEQ ID NOS:I, 3 and 5, respectively. Also 
contemplated are polynucleotides encoding TNFR polypeptides which lack an 
amino terminal methionine such polynucleotides having a nucleotide sequence 
representing the portion of SEQ ID NOS:l, 3 and 5 which consist of positions 
1 86-959. 28-924. and 76-582 of SEQ ID NOS: 1 .3 and 5. respectively. 
Polypeptides encoded by such polynucleotides arc also provided, such 
polypeptides comprising an amino acid sequence at positions 2-259, 2-30O, 
and 2-1 "0 of SEQ ID NOS:2, 4, and 6, respectively. 

In addition, the invention provides nucleic acid molecules having 
nucleotide sequences related to extensive portions of SEQ ID NOS:l, 3 and 5 
which have been determined from the following related cDNA clones: 
HhLBP'OR (SEQ ID NO:2l ), HPRCB54R (SEQ ID NO:19). HSJAU57RA 
(SEQ ID NO 20) and HLSCB54R (SEQ ID NO:22) arc related to SEQ ID 
NO 1. IIHLDI06R (SEQ ID NO:23)and IKEOW3XR (SEQ ID NO:24)are 
related to SEQ ID NOS:3 and 5. The nucleotide sequences of each of these 
climes in slum n in Figure X 

More- generally, by a fragment of an isolated nucleic acid molecule 
having the nucleotide sequence of the deposited cDNA or the nucleotide 
sequence slimw) in Figures 1 . 2 or (SEQ ID NOS I. 3 or 5) is intended 
fragments a» k-ast about I 5 nt. and more preferably at least about 20 nt, still 




more preferably at least about 30 nt, and even more preferably, at least about 
40 nt in length which arc useful as diagnostic probes and primers as discussed 
herein. Of course, larger fragments 50-300 nt in length are also useful 
according to the present invention as are fragments corresponding to most, if 

5 not all, of the nucleotide sequence of the deposited cDNAs or as shown in 

Figures 1, 2 and 3 (SEQ ID NOS: 1 , 3 and 5). By a fragment at least 20 nt in 
length, for example, is intended fragments which include 20 or more contiguous 
bases from the nucleotide sequence of a deposited cDN A or the nucleotide 
sequence as shown in Figures 1. 2 and 3 (SLQ ID NOS:l, 3 and 5). Preferred 

io nucleic acid fragments of the present invention include nucleic acid molecules 

encoding epitope-bearing portions of the TNFR polypeptides as identified in 
Figures 5. 6 and 7 and described in more detail below. 

In another aspect, the invention provides an isolated nucleic acid 
molecule compnsing a polynucleotide which hybridizes under stringent 

1 5 hybridization conditions to a portion of the polynucleotide in a nucleic acid 

molecule of the invention described above, for instance, a cDNA clone 
contained in ATCC Deposit No. 97798, 97810 or 97809. By "stringent 
hybridization conditions" is intended overnight incubation at 42° C in a 
solution comprising: 50" o formamidc. 5x SSC (150 m\! NaCI. 15 m\1 

:! . tnsodium curate >. 50 mM sodium phosphate (pH 7.6). 5x Dcnhardl's solution. 

10*11 dextran sulfate, and 20 pg ml denatured, sheared salmon sperm DNA, 
ttiiiowcd by washing the filters in 0.1 x SSC at about 65' C. 

By a polynucleotide which hybridizes to a "portion" of a 
polynucleotide is intended a polynucleotide (cither DNA or RNA) hybridizing 
to at least about 15 nucleotides (nt), and more preferably at least about 20 nt. 
siil! more prctciably ai lea >i about 30 nt. and even more preferably about 
^ij.Ti) ( C £ . 5<i> m of the reference polynucleotide. These are useful as 
diagnostic probes and primers as discussed above and in more detail below. 

liy .i p..ni.m ( >f a polynucleotide of "at least 20 nt in length." for 
cvample. is . vemled 20 or more contiguous nucleotides from the nucleotide 
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sequence of the reference polynucleotide («.g., a deposited eDN>\,pr^ 
nucleotide sequence as shown in Figure 1,2 or 3 (SEQ ID NO: 1,3 or 5)). Of 
course, a polynucleotide which hybridizes only to a poly A sequence (such as 
the y terminal poly(A) tract of a f NFR cDNA, or to a complementary stretch 
of T (or L ) residues, w ould not be included in a polynucleotide of the 
invention used to hybridize to a portion of a nucleic acid of the invention, 
since such a polynucleotide would hybridize to any nucleic acid molecule 
containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDN A clone). 

As indicated, nucleic acid molecules of the present invention which 
encode a TNFR polypeptide may include, but are not limited to those 
encoding the ammo acid sequence of the mature polypeptide, by itself; and the 
coding sequence for the mature polypeptide and additional sequences, such as 
those encoding the about 26-3 1 amino acid leader or secretory sequence, such 
as a pre-, or pre- or prepro- protein sequence; the coding sequence of the 
mature polypeptide, with or without the aforementioned additional coding 
sequences. 

Also encoded by nucleic acids of the invention are the above protein 
sequences together with additional, non-coding sequences, including for 
example, but not limited to introns and non-coding 5" and 3' sequences, such 
a< the transcribed, non-translated sequences that play a role in transcription, 
mRNA processing, including splicing and polyadcnylation signals, for example 
- nboMjmc binding and stability of mRNA; an additional coding sequence 
which codes for additional amino acids, such as those which provide additional 
functionalities. 

Thus, the sequence encoding the polypeptide may be fused to a marker 
sequence. ^ulH j> a sequence encoding a peptide which facilitates purification 
of the fused polypeptide. In certain preferred embodiments of this aspect of 
iho iu\ JnMt.n. the marker ammo acid sequence r a hexa-histidme peptide, such 
as the i.te provided in a pOi vector ((JIAGItN. Inc.. ^25^ Lion Avenue, 
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Chatsworth, CA, 91311), among others, many of which are commercially 
available. As described in Gcntz ct al. Proc. Natl Acad. Sci. USA <W:821-824 
(1989), for instance, hcxa-histidine provides for convenient purification of the 
fusion protein. The "HA" tag is another peptide useful for purification which 
corresponds to an epitope derived from the influenza hemagglutinin protein, 
which has been described by Wilson ct aL. Celt 37: 767 (1984). As discussed 
below, other such fusion proteins include a TNFR-5. -6a or -6p fused to Fc at 
the N- or C-terminus. 

Variant and Mutant Polynucleotides 

The present invention further relates to variants of the nucleic acid 
molecules of the present invention, which encode portions, analogs or 
derivatives of a TNFR polypeptide. Variants may occur naturally, such as a 
natural allelic variant. By an "allelic variant" is intended one of several 
alternate forms of a gene occupying a given locus on a chromosome of an 
organism. Genes IL Lcwin, B., ed., John Wiley & Sons, New York (1985). 
Non-narurally occurring variants may be produced using art-known 
mutagenesis techniques. 

Such variants include those produced by nucleotide substitutions, 
deletions or additions. The substitutions, deletions or additions may involve 
one or more nucleotides. The variants may be altered in coding regions, 
non-coding regions, or both. Alterations in the coding regions may produce 
conservative or non-conservative amino acid substitution?, deletions or 
additions. Especially preferred among these are silent substitutions, additions 
and deletions, which do not alter the properties and activities of the TNFR 
poK peptide or portions thereof. Also especially preferred in this regard are 
l« tiiserv jtive substitutions. 

Highly preferred are nucleic acid molecules encoding a mature protein 
Im-'mi^ an aimint acid sequence shown in SHQ II) NOS 2. 4 and 6 or the mature 
I NI K pul> peptide sequences encoded by the deposited cl)NA clones. 



Most highly preferred are nucleic acid molecules encoding the 
extracellular domain of a protein having the amino acid sequence shown in SEQ 
ID NO:2. 4 or 6 or the extracellular domain of a TNFR amino acid sequence 
encoded by a deposited cDNA clone. 

Further embodiments include an isolated nucleic acid molecule 
comprising a polynucleotide having a nucleotide sequence at least 90% 
identical, and more preferably at least 95%, 96%. 97%, 98% or 99% identical 
to a pol> nucleotide selected from the group consisting of: (a) a nucleotide 
sequence encoding a TNFR polypeptide having the complete amino acid 
sequence in SEQ ID NO:2, 4 or 6, or as encoded bv a cDNA clone contained in 
ATCC Deposit No. 97798, 97810, or 97809; (b) a nucleotide sequence 
encoding a mature TNFR polypeptide having an amino acid sequence at 
positions 27-259, 31-300 or 31-170 in SEQ ID NO:2, 4 or 6, respectively, or 
as encoded by a cDNA clone contained in ATCC Deposit No. 97798, 97810 
or 97S09; (c) a nucleotide sequence encoding a soluble extracellular domain of a 
TNFR polypeptide having the amino acid sequence at positions 27-240, 31- 
283. and 3 M 66 of SEQ ID NOS:2, 4 and 6, respectively; and (d) a nucleotide 
sequence complementary to any of the nucleotide sequences in (a), (b) or (c) 
above. 

Further embodiments of the invention include isolated nucleic acid 
molecules that comprise a polynucleotide having a nucleotide sequence at least 
90% identical, and more preferably at least 95%, 96%. 97%, 98% or 99% 
identical, to any of the nucleotide sequences in (a), (b), (c), or (d), above, or a 
polynucleotide which hybridi7es under stringent hybridization conditions to a 
polynucleotide in (a), (b). (c), or (d), above. This polynucleotide which 
hyhndi/cs doe> not hybridize under stringent hybridi7ation conditions to a 
polynucleotide having a nucleotide sequence consisting of only A residues or 
of only T residues. An additional nucleic acid embodiment of the invention 
rj.iicv in an isolated nuc leic acid molecule comprising a polynucleotide which 



encodes the amino acid sequence of an cpitope-bearing portion of a TNFR 
polypeptide having an amino acid sequence in (a), (b), (c), or (d). above. 

The present invention also relates to recombinant vectors, which 
include the isolated nucleic acid molecules of the present invention, and to host 
cells containing the recombinant vectors, as well as to methods of making such 
vectors and host cells and for using them for production of TNFR 
polypeptides or peptides by recombinant techniques. 

By a polynucleotide having a nucleotide sequence at least, for example. 
95% "identical" to a reference nucleotide sequence encoding a TNFR 
polypeptide is intended that the nucleotide sequence of the polynucleotide is 
identical to the reference sequence except that the polynucleotide sequence 
may include up to five point mutations per each 100 nucleotides of the 
reference nucleotide sequence encoding the TNFR polypeptide. In other 
words, to obtain a polynucleotide having a nucleotide sequence at least 95% 
identical to a reference nucleotide sequence, up to 5% of the nucleotides in the 
reference sequence may be deleted or substituted with another nucleotide, or a 
number ot nucleotides up to 5% of the total nucleotides in the reference 
sequence may be inserted into the reference sequence. These mutations of the 
reference sequence may occur at the 5* or 3* terminal positions of the reference 
nucleotide sequence or anywhere between those terminal positions, 
interspersed cither individually among nucleotides in the reference sequence or 
in one or more contiguous groups within the reference sequence. 

As a practical matter, whether any particular nucleic acid molecule is at 
least 90° o, 95° o. 96° 97%. 98% or 99% identical to, for instance, a nucleotide 
sequence shown in Figure I, 2 or 3. or to the nucleotides sequence of a 
deposited cDNA clone can be determined conventionally using known 
computer program* such as the Best fit program (Wisconsin Sequence Analysis 
Package. Version X for Unix. Genetics Comp.-tcr Group. University Research 
i'urk. 575 Science Drive. MuJinoii. Wl 5371 1 i. Bcstfit uses the local 
homnlug) algorithm of Smith and Waterman. Advances in Applied Mathematics 



2:482-489 ( 1981), to find the best segment of homology between two 
sequences. When using Bcstfit or any other sequence alignment program to 
determine whether a particular sequence is, for instance, 95% identical to a 
reference sequence according to the present invention, the parameters arc set, 
of course, such that the percentage of identity is calculated over the full length 
of the reference nucleotide sequence and that gaps in homology of up to 5% of 
the total number of nucleotides in the reference sequence arc allowed. 

The present application is directed to nucleic acid molecules at least 
90° 95%. 96%. 97%. 98% or 99% identical to a nucleic acid sequence shown 
in Figure 1.2 or 3 (SEQ ID NO: 1 . 3 or 5) or to the nucleic acid sequence of a 
deposited cDNA, irrespective of whether they encode a polypeptide having 
TNFR activity. This is because even where a particular nucleic acid molecule 
does not encode a polypeptide having TNFR activity, one of skill in the art 
would still know how to use the nucleic acid molecule, for instance, as a 
hybridization probe or a polymerase chain reaction (PCR) primer. Uses of the 
nucleic acid molecules of the present invention that do not encode a 
polypeptide having TNFR activity include, inter alia, (I) isolating a TNFR 
gene or allelic variants thereof in a cDN A library; (2) in situ hybridization (e.g., 
"FISH") to metaphase chromosomal spreads to provide precise chromosomal 
location of the TNFR gene, as described in Verma ct al.. Human Chromosomes: 
A Manual of Basic Techniques, Pergamon Press, New York (1988); and 
Northern Blot analysis for detecting TNFR mRNA expression in specific 
tissues. 

Preferred, however, are nucleic acid molecules having sequences at least 
90%. 95%. 96%, 97%, 98% or 99% identical to a nucleic acid sequence shown 
in Figure 1. 2 or 3 (SEQ ID NO: I. 3 or 5) or to the nucleic acid sequence of a 
deposited cDNA which do. in fact, encode polypeptides having TNFR protein 
activity. By "a polypeptide having TNFR activity" is intended polypeptides 
exhibiting activity similar, but not necessarily identical, to an activity of a 
• mature or extracellular forms of a TNFR-5. -6a or -6(J protein of the 



invention, as measured in a particular biological assay. The TNF family Vv5 
ligands induce various cellular responses by binding to TNF-family receptors, 

V ' - *V 

including the TNFR-5, -6a and -60 of the present invention. Cells which 

express the TNFR proteins are believed to have a potent cellular response to \ ^ 

TNFR-I receptor ligands including B lymphocytes (CD19+), both CD4 and V-., v 

CD8+ T lymphocytes, monocytes and endothelial cells. By a "cellular response ^ 
to a TNF-family ligand" is intended any genotypic, phenotypic, and/or * ' 

morphological change to a cell, cell line, tissue, tissue culture or patient that is 
induced by a TNF-family ligand. As indicated, such cellular responses include 
not only normal physiological responses to TNF-family ligands, but also 

diseases associated with increased cell proliferation or the inhibition of increased ^ 
cell proliferation, such as by the inhibition of apoptosis. !" 

Screening assays for the forgoing are known in the art. One such ; >■ ^ 

screening assay involves the use of cells which express the receptor (fir Sy> # 5 
example, transfected CHO cells) in a system which measures extracellular pH ia*gA 
changes caused by receptor activation, for example, as described in Science 
246: 181-296 (October 1989). For example, a TNF-family ligand may be 
contacted with a cell which expresses the mature form of the receptor 
polypeptide of the present invention and a second messenger response, e.g.. 
signal transduction or pH changes, may be measured to determine whether the 
TNFR polypeptide is active. 

Of course, due to the degeneracy of the genetic code, one of ordinary 
skiil in the art will immediately recognize that a large number of the nucleic 
acid molecules having a sequence at least 90%, 95%, 96%. 97%, 98%. or 99% 

identical to the nucleic acid sequence of a deposited cDN A or the nucleic acid 

sequence shown in Figure 1 . 2 or 3 (SEQ ID NO: 1 , 3 or 5) will encode a 

polypeptide "having TNFR protein activity." In fact, since degenerate 

variants of these nucleotide sequences all encode the same polypeptide, this 

will be clear to the skilled artisan even without performing the above described 

comparison assay. It will be further recognized in the art that, for such nucleic 

acid molecules that arc not degenerate variants, a reasonable number will also 

encode a polypeptide having TNFR protein activity. This is because the 

skilled artisan is fully aware of amino acid substitutions that arc either less 

likely or not likely to significantly effect prote.n function (e.g., replacing one 
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aliphatic amino acid with a second aliphatic amino acid), as further described 
below. 

Vectors and Host Cells 

The present invention also relates to vectors which include the isolated 
DNA molecules of the present invention, host ceils which are genetically 
engineered with the recombinant vectors, and the production of TNFR 
polypeptides or fragments thereof by recombinant techniques. The vector 
may be. for example, a phage, plasmid, viral or retroviral vector. Retroviral 
vectors may b" replication competent or replication defective. In the latter 
case, virai propagation generally will occur only in complementing host cells. 

The polynucleotides may be joined to a vector containing a selectable 
marker for propagation in a host. Generally, a plasmid vector is introduced in 
a precipitate, such as a calcium phosphate precipitate, or in a complex with a 
charged lipid. If the vector is a virus, it may be packaged in vitro using an 
appropriate packaging cell line and then transduced into host cells. 

The DNA insert should be operatively linked to an appropriate 
promoter, such as the phage lambda PL promoter, the E. coli lac, trp.phoA and 
tac promoters, the SV40 early and late promoters and promoters of retroviral 
LTRs. to name a few. Other suitable promoters will be known to the skilled 
artisan. The expression constructs will further contain sites for transcription 
iniriation. termination and. in the transcribed region, a ribosome binding site for 
translation. The coding portion of the transcripts expressed by the constructs 
will preferably include a translation initiating codon at the beginning and a 
termination codon (UAA. UGA or UAG) appropriately positioned at the end 
of the polypeptide to be translated. 

As indicated, the expression vectors will preferably include at least one 
selectable marker. Such markers include dihydrofolate reductase, G418 or 
neomycin resistance for eukaryotic cell culture and tetracycline, kanamycin or 
ympieillin resistance genes for cultunng in E. coli and other bacteria. 
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Rcprescntative examples of appropriate hosts include, but are not limited to, 
bacterial cells, such as E. co/i, Strcptomyces and Salmonella typhimurium cells; 
fungal cells, such as yeast cells; insect cells such as Drosophila S2 and 
Spodoptera Sf9 cells; animal cells such as CHO, COS, 29* and Bowes 
melanoma cells; and plant cells. Appropriate culture mediums and conditions 
for the above-described host cells are known in the art. 



Among vectors preferred for use in bacteria include pQE70, pQE60 
and pQE-9, available from Ql AGEN, Inc., supra; Phagescript vectors, 
Blucscnpt vectors, pNH8A, pNH!6a, pNH18A. pNH46A, available from 
Stratagcne; and ptrc99a. pKK223-3, pKK233-3, pDR540, pRITS available 
from Pharmacia. Among preferred eukaryotic vectors are pWLNEO, 
pSVZCAT, pOG44. pXTl and pSG available from Stratagenc; and pSVK3, 
pBPV. pMSG and pSVL available from Pharmacia. Other suitable vectors will 
be readily apparent to the skilled artisan. Introduction of the construct into 
the host cell can be effected by calcium phosphate transfection, DEAE-dextran 
mediated transfection, cationie lipid-mediatcd transfection, electroporation, 
transduction, infection or other methods. Such methods arc described in many 
standard laboratory manuals, such as Davis et a!.. Basic Methods In Molecular 
Biology (1986). The polypeptide may be expressed in a modified form, such 
as a fusion protein, and may include not only secretion signals, but also 
additional heterologous functional regions. For instance, a region of additional 
amino acids, particularly charged amino acids, may be added to the N-tcrminus 
of the polypeptide to improve stability and persistence in the host cell, during 
purification, or during subsequent handling and storage. Also, peptide 
moieties may be added to the polypeptide to facilitate purification. Such 
regions may be removed prior to final preparation of the polypeptide. The 
addition of peptide moieties to polypeptides to engender secretion or 
excretion, to improve stability and to facilitate purification, among others, arc 
familiar and routine techniques in the art. A preferred fusion protein 
comprises a heterologous region from immunoglobulin that is useful to 
stabilize and purify proteins For example. EP-A-O 464 533 (Canadian 
counterpart 2045X69) discloses fusion proteins comprising various portions of 
constant region of immunoglobulin molecules together with another human 
protein or pan thereof. In many cases, the Fc pail in a fusion protein is 
thoroughly advantageous for use in therapy and diagnosis and thus results, for 
example, in improved pharmacokinetic properties ( EP-A 0232 262). On the 
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other hand, for some uses it would be desirable to be able to delete the Fc part 
after the fusion protein has been expressed, detected and purified in the 
advantageous manner described. This is the case when Fc portion proves to be 
a hindrance to use in therapy and diagnosis, for example when the fusion 
protein is to be used as antigen for immunizations. In drug discovery, for 
example, human proteins, such as hIL-5. have been fused with Fc portions for 
the purpose of high-throughput screening assays to identify antagonists of 
hIL-5. See, D. Bennett et jL> J. Molecular Recognition 5:52-58 ( 1 995) and K. 
Johanson et aL J. Biol. Chem. 270:9459-9471 (1995). 

The TNFR proteins can be recovered and purified from recombinant 
cell cultures by well-known methods including ammonium sulfate or ethanol 
precipitation, acid extraction, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, 
affinity chromatography, hydroxylapatite chromatography and lectin 
chromatography. Most preferably, high performance liquid chromatography 
("HPLC") is employed for purification. Polypeptides of the present invention 
include: products purified from natural sources, including bodily fluids, 
tissues and cells, whether directly isolated or cultured; products of chemical 
synthetic procedures; and products produced by recombinant techniques from 
a prokaryotic or eukaryotic host, including, for example, bacterial, yeast, 
higher plant, insect and mammalian cells. Depending upon the host employed 
in a recombinant production procedure, the polypeptides of the present 
invention may be glycosylated or may be non-glycosylated. In addition, 
polypeptides of the invention may a!:o include an initial modified methionine 
residue, in some cases as a result of host-mediated processes. 

Polypeptides and Fragments 

The invention further provides isolated TNFR polypeptides having 
the ammo acid sequences encoded by the deposited cDNAs. or the amino acid 
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sequences in SEQ ID NOS2, 4 and 6, or a peptide or polypeptide comprising 
a portion of the above polypeptides. 

I ariant and Mutant Polypeptides 

To improve or alter the characteristics of a TNFR polypeptide, 
protein engineering may be employed. Recombinant DNA technology known 
to those skilled in the art can be used to create novel mutant proteins or 
"muteins" including single or multiple amino acid substitutions, deletions, 
additions or fusion proteins. Such modified polypeptides can show, e.g., 
enhanced activity or increased stability. In addition, they may be purified in 
higher yields and show better solubility than the corresponding natural 
polypeptide, at least under certain purification and storage conditions. 

X-Terminal and C-Terminal Deletion Mutants 

For instance, for many proteins, including the extracellular domain of a 
membrane associated protein or the mature form(s) of a secreted protein, it is 
known in the art that one or more amino acids may be deleted from the N- 
terminus or C-terminus without substantial loss of biological function. For 
instance. Ron ct al.../ Btol. CI;cm.. 2^:2984-2988 ( 1993) reported modified 
KGF proteins that had heparin binding activity even if 3, 8, or 27 amino- 
tcrminal amino acid residues were missing. In the present case, since the 
proteins of the invention are members of the TNFR polypeptide family, 
deletions of N-teirmnal amino acids up to the Cysteine at position 53 of SHQ 
ID NO 2 (TNFR-5). and 49 of SHQ ID NOS:4 and 6 <TNFR-6a and -60) may 
retain some biological activity such as regulation of proliferation and apoptosis 
of lymphoid cells Polypeptides having further N -terminal deletions including 
the C 53 rcMduc in SI:Q ID NO:2. or the C4^ residue in Shy ID NOS.4 and 6, 
* >uid n»>t he cxpccicd u> retain such biological activities because it is known 
tlui these residues in a TNFR -related polypeptide are required for forming a 
di>uitidc h-ndgc t« ■ provide structural stability which is needed for receptor 
binding and signal transduction 
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However, even if deletion of one or more amino acids from the N- 
terminus of a protein results in modification of loss of one or more biological 
functions of the protein, other biological activities may still be retained. Thus, 
the ability of the shortened protein to induce and/or bind to antibodies which 
recognize the compleie or extracellular domain of the TNFR protein generally 
will be retained when less than the majority of the residues of the complete 
protein or extracellular domain are removed from the N-terminus. Whether a 
particular polypeptide lacking N-terminal residues of a complete protein 
retains such immunologic activities can readily be determined by routine 
methods described herein and otherwise known in the art. 

Accordingly, the present invention further provides polypeptides 
having one or more residues deleted from the amino terminus of the amino acid 
sequence of the TNFR show n in SEQ ID NOS:2, 4 and 6, up to the cysteine 
residue in each which are at position numbers 53, 49 and 49, respectively, and 
polynucleotides encoding such polypeptides. In particular, the present 
invention provides TNFR -5 polypeptides comprising the amino acid sequence 
of residues m-259 of SEQ ID NO:2 where m is an integer in the range of 1 -53 
where 53 is the position of the first cysteine residue from the N-terminus of 
the complete TNFR-5 polypeptide (shown in SEQ ID NO:2) believed to be 
required for activity of the TNFR-5 protein; and TNFR-6ct and -60 
polypeptides comprising the amino acid sequence of residues n-300 and 170 of 
ShQ ID NOS:4 and 6, respectively where n is an integer in the range of 1-49 
where 49 is the position of the first cysteine residue from the N-tenninus of 
the complete TNFR -6a and -6P polypeptides (shown in SEQ ID NOS:4 and 
6. respectively) believed to be required for activity of the TNFR-6ct and -60 
protein** 

Mure in particular, the invention provides polynucleotides encoding 
polypeptides having the ammo acid sequence of residues: 1-259, 2-259, 3-259, 
4.2^. 5-259. 6-259. 7-259. X-259. 9-259, 10-259. 1 1-259, 12-259, 13-259, 
14-259. 15-259. lf>-259. 17-259, IX-259, 19-259. 20-259,21-259, 22-259. 23- 
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259, 24-259. 25-259, 26-259, 27-259, 28-259, 29-259, 30-259, 3 1-259, 32- 
259, 33-259, 34-259, 35-259, 36-259, 37-259, 38-259, 39-259, 40-259, 41- 
259, 42-259. 43-259, 44-259, 45-259, 46-259, 47-259, 48-259, 49-259, 50- 
259, 51-259, 52-259, and 53-259 of SEQ ID NO:2; 1-300, 2-300, 3-300, 4- 
300, 5-300, 6-300. 7-300. 8-300, 9-300, 10-300, 1 1-300, 12-300, 13-300. 14- 
300, 15-300. 16-300, 17-300, 18-300, 19-300, 20-300, 21-300, 22-3^0, 23- 
300. 24-300. 25-300, 26-300. 27-300, 28-300, 29-300, 30-300, 31-300, 32- 
300, 33-300, 34-300. 35-300, 3o-300. 37-300, 38-300, 39-300, 40-300, 41- 
300. 42-300. 43-300. 44-300, 45-300, 46-300, 47-300, 48-300, and 49-300 of 
S EQ I D NO :4 ; and 1 - 1 70, 2- 1 70, 3- 1 70, 4- 1 70, 5- 1 70, 6- 1 70, 7- 1 70, 8- 1 70, 9- 
170, 10-170. 11-170, 12-170, 13-170, 14-170, 15-170, 16-170, 17-170, 18- 
1 70. 1 9- 1 ~0. 20- 1 70, 2 1 - 1 70, 22- 1 70, 23- 1 70. 24- 1 70, 25- 1 70, 26- 1 70, 27- 
1 70, 28- 1 70. 29- 1 70. 30- 1 70. 3 1 - 1 70, 32- 1 70, 33- 1 70, 34- 1 70. 35- 1 70, 36- 
1 70, 37- i "0, 38- 1 70, 39- 1 70, 40- 1 70, 4 1 - 1 70, 42- 1 70, 43- 1 70, 44- 1 70, 45- 
I 70, 46- 1 70, 47- 1 70, 48- 1 70, and 49- 1 70 of ^EQ ID NO:6. Polynucleotides 
encoding these polypeptides also are provided. 

Similarly, many examples of biologically functional C-terminal deletion 
mutcins are known. For instance, interferon gamma shows up to ten times 
higher activities by deleting 8-10 amino acid residues from the carboxy 
terminu> of the protein (Dobcli et al.,7. Biotechnology 7:199-216 (1988)). In 
the present case, since the protein of the invention is a member of the TNFR 
polypeptide family, deletions of C-terminal amino acids up to the cysteine at 
position 149, 193 and 132 of SEQ ID NOS:2, 4 and 6, respectively, may 
retain some biological activity such as regulation of proliferation and apoptosis 
of lymphoid cells. Polypeptides having further C-terminal deletions including 
the cysteines at poMtions 149. 193. and 132 of SEQ ID NOS.2. 4 and 6, 
respectively, would not be expected to retain such biological activities because 
it is kno i An that these residues in TNF receptor-related polypeptides are 
required fur forming disulfide bridges to provide structural stability which is 
needed lor receptor binding- 
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However, even if deletion of one or more amino acids from the C- 
terminus of a protein lesults in modification of loss of one or more biological 
functions of the protcm. other biological activities may still be retained. Thus, 
the ability of the shortened protein to induce and/or bind to antibodies which 
recognize the complete or mature form of the protein generally will be retained 
when less than the majority of the residues of the complete or mature form 
protein arc removed from the C-tcrminus. Whether a particular polypeptide 
lacking C-ierminal residues of a complete protein retains such immunologic 
activities can readily be determined by routine methods described herein and 
otherwise known in the art. 

Accordingly, the present invention further provides polypeptides 
having one or more residues from the carboxy terminus of the amino acid 
scqucr.ee of TNFR-5, -6a and -6P shown in SEQ ID NOS:2, 4 and 6 up to the 
cysteine at position 149, 193 and 132 of SEQ ID NOS:2, 4 and 6, 
respectively, and polynucleotides encoding such polypeptides. In particular, 
the present invention provides polypeptides having the amino acid sequence 
of residues 1-x, 1-y. and 1-z of the amino acid sequence in SEQ ID NOS:2, 4 
and 6. respectively, where x is any integer in the range of 149-259, where y is 
any integer in the range of 1 93-300. and z is any integer in the range of 1 32- 
1 70. Polynucleotides encoding these polypeptides also are provided. 

The invention also provides polypeptides having one or more amino 
acids deleted from both the amino and the carboxyl termini, which may be 
described generally as having residues m-x of SEQ ID NO:2, n-y of SEQ ID 
NO:4. and n-z of SEQ ID NO:6 where m, n, x, y and z are integers as 
described above. 

Also included arc a nucleotide sequence encoding a polypeptide 
consisting of a portion of a complete TNFR amino acid sequence encoded by a 
cDNA clone contained in ATCC Deposit No 9779X, 97810, or 97809, where 
this portion excludes from I to about 49. 53. or 53 amino acids from the amino 
terminus of the complete amino acid sequence encoded by the cDNA clone 





* 



contained in ATCC Deposit No. 97798, 97810, and 97809, respectively, or 
from 1 to about 1 10, 107, or 58 amino acids from the carboxy terminus of the 
complete ammo acid sequence encoded by the cDNA clone contained in 
ATCC Deposit No. 97798, 97810 and 97809, respectively, or any 
5 combination of the above amino terminal and carboxy terminal deletions, of the 

complete ammo acid sequence encoded by the cDNA clone contained in 
ATCC Deposit No. 97798, 97810 or 97809. Polynucleotides encoding all of 
the above deletion mutant polypeptide form, also are provided. 

Other Mutants 

l0 in addition to terminal deletion forms of the prote... discussed above, it 

also will be recognized by one of ordinary skill in the art that some amino acid 
sequences of the TNFR polypeptides can be varied without significant effect 
on the structure or function of the proteins. If such differences in sequence are 
contemplated, it should be remembered that there will be critical areas on the 

15 protein which determine activity. 

Thus, the invention further includes variations of the TNFR 
polypeptides which show substantial TNFR polypeptide activity or which 
include regions of TNFR protein such as the protein portions discussed below. 
Such mutants include deletions, insertions, inversions, repeats, and type 

:n substitutions selected according to general rules known in the art so as have 

little effect on activity. For example, guidance concerning how to make 
phcnorypically silent amino acid substitutions is provided in Bowie, J. U. et 
al.. "Deciphering the Message in Protein Sequences: Tolerance to Amino Acid 
Substitutions." Science 24 7; I 30ft- 1310(1 990). wherein the authors indicate 

: < that there arc tw o main approaches for studying the tolerance of an amino acid 

sequence to change. The first method relics on the process of evolution, in 
which mutations are either accepted or rejected by natural selection The 
second approach uses genetic engineering to introduce amino acid changes at 
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specific positions of a cloned gene and selections or screens to identify 
sequences that maintain functionality. 

As the authors state, those studies have revealed that proteins arc surprisingly 
tolerant of amino acid substitutions. The authors further indicate which amino 

5 acid changes arc likely to be permissive at a certain position of the protein. 

For example, most buried amino acid residues require nonpoiar siJc chains, 
whereas few features of surface side chains arc generally conserved. Other 
such phcnorypically silent substitutions are described in Bowie. J. U. . ai, 
supra, and the references cited therein. Typically seen as conservative 

10 substitutions arc the replacements, one for another, among the aliphatic amino 

acids Ala. Val, Leu and He; interchange of the hydroxyl residues Ser and Thr. 
exchange of the acidic residues Asp and Glu, substitution between the amide 
residues Asn and Gin, exchange of the basic residues Lys and A;g and 
replacements among the aromatic residues Phc. Tyr. Thus, the fragment. 

15 derivative or analog of the polypeptide of SEQ ID NO:2, 4 or 6, or that 

encoded by a deposited cDNA, may be (i) one in which one or more of the 
amino acid residues are substituted with a conserved or non-conserved amino 
acid residue (preferably a conserved amino acid residue) and such substituted 
amino acid residue may or may not be one encoded by the genetic code, or (ii) 

2,, one in which one or more of the amino :icid residues includes a substituent 

group, or (iii) one in which the mature jr soluble extracellular polypeptide is 
fused w ith another compound, such as a compound to increase the half-life of 
the polypeptide (for example, polyethylene glycol), or (iv) one in which the 
additional amino acids are fused to the above form of the polypeptide, such as 

25 an IgG Fc fusion region peptide or leader or secretory sequence or a sequence 

which is employed for purification of the above form of the polypeptide or a 
proprotein sequence. Such fragments, derivatives and analogs arc decme* 1 to Yq 
within the scope of those skilled in the an from the icachings herein. 

thus, the TNFR of the present invention may include one or more 

-wi amino acid substitutions, deletions or additions, cither from natural mutations 
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TABLE 1. Conservative Amino Acid Substitutions 



Aromatic 



Hydrophobic 



Polar 



Basic 



Acidic 



Small 



Phenylalanine 

Tryptophan 

Tyrosine 

Leucine 

Isolcucinc 

Valine 

Glutaminc 
Asparaginc 

Argininc 

Lysine 

Histidine 

Aspartic Ac»d 
Glutamic Acid 

Alanine 
Serine 
Threonine 
Methionine 
Glycine 



Amino acids in the TNFR proteins of the present invention that are 
essential for function can be identified by methods known in the art, such as 
site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham and 
Wells. Science 1 08 1 - 1 085 ( 1 °89». The latter procedure introduces single 
alanine mutations at every residue in the molecule. The resulting mutant 
molecules arc then tested for biological activity such as receptor binding or in 
vitro or in vitro proliferative activity. 

Of special interest are substitutions of charged amino acids with other 
charged or neutral amino acids which may produce proteins with highly 
desirable improved characteristics, such as less aggregation. Aggregation may 
not only reduce activity but also be problematic when preparing 
pharmaceutical formulations, because aggregates can be immunogenic (Pmckard 
etui. Clin Exp. Immunol 2:331-340(1967); Robbins et al.. Diabetes 36: 838- 



845 (1987); Cleland ct ai. Crit Rev. Therapeutic Drug Carrier Systems 
70:307-377(1993). 

Replacement of amino acids can also change the selectivity of the 
binding of a ligand to cell surface receptors. For example, Ostadc ct ai, Nature 
Jrt 7:266-268 (1993) describes certain mutations resulting in selective binding 
of TNF-a to only one of the two known types of TNF receptors. Sites that 
are critical for ligand-reccptor binding can also be determined by structural 
analysis such as cry stallization, nuclear magnetic resonance or photoatTinity 
labeling (Smith ct ai, J, \fnl. Biol 224: 8 99-904 (1992) and de Vos et ai 
Science '.55:306-3 12(1 992)). 

Since TNFR-5, -6a and -6p are members of the TNF receptor-related 
protein family, to modulate rather than completely eliminate biological 
activities of TNFR preferably mutations are made in sequences encoding 
amino acids in the TNFR conserved extracellular domain, more preferably in 
residues within this region which are not conserved among members of the 
TNF receptor family. Also forming pan of the present invention are isolated 
polynucleotides comprising nucleic acid sequences which encode the above 
TNFR mutants. 

The polypeptides of the present invention arc preferably provided in 
an isolated form, and preferably are substantially purified. A recombinantly 
produced version of the TNFR polypeptides can be substantially purified by 
the one-step method described in Smith and Johnson. Gene 67:3 1-40 (1988). 
Polypeptides of the invention also can be purified from natural or recombinant 
sources using anti-TNFR-5. -6a and -6(3 antibodies of the invention in 
methods which are well known in the art of protein purification. 



•37. 



By a polypeptide having an amino acid sequence at least, for example, 
95% "identical" to a reference amino acid sequence of a TNFR polypeptide is 
intended that the amino acid sequence of the polypeptide is identical to the 
reference sequence except that the polypeptide sequence may include up to 
five amino acid alterations per each 100 amino acids of the reference amino acid 
of the TNFR polypeptide. In other words, to obtain a polypeptide having an 
amino acid sequence at least 95% identical to a reference amino acid sequence, 
up to 5% of the amino acid residues in the reference sequence may be deleted 
or substituted with another amino acid, or a number of amino acids up to 5% 
of the total amino acid residues in the reference sequence may be inserted into 
the reference sequence. These alterations of the reference sequence may occur 
at the amino or carbc:;y terminal positions of the reference amino acid 
sequence or anywhere between those terminal positions, interspersed cither 
individually among residues in the reference sequence or in one or more 
contiguous groups within the reference sequence. 

As a practical matter, whether any particular polypeptide is at least 
90%, 95° o. 9r»%, 97%. 98% or 99% identical to, for instance, the amino acid 
sequence shown in SEQ ID NO:2, 4 or 6, or to an amino acid sequence 
encoded by a deposited cDNA clone can be determined conventionally using 
known computer programs such the Bestfit program (Wisconsin Sequence 
Analysis Package. Version 8 for Unix, Genetics Computer Group, University 
Research Park, 575 Science Orive, Madison, Wl 537 1 1 ). When using Bestfit 
or any other sequence alignment program to determine whether a particular 
sequence is. for instance, 95% identical to a reference sequence according to the 
present invention, the parameters are set. of course, such that the percentage 
of identity is calculated over the full length of the reference amino acid 
sequence and that gaps in homology of up to 5% of the total number of amino 
actd residue b in the reference sequence are allowed. 



The polypeptide of the present invention could be used as a molecular 
weight marker on SDS-PAGE gels or on molecular sieve gel filtration columns 
using methods w ell known to those of skill in the art. 

As described in detail below, the polypeptides of the present invention 
can also be used to raise polyclonal and monoclonal antibodies, which arc 
useful in assays for detecting TNFR protein expression as described below or 
as agonists and antagonists capable of enhancing or inhibiting TNFR protein 
function. Further, such polypeptides can be used in the yeast two-hybrid 
system to "capture" TNFR protein binding proteins which are also candidate 
agonists and antagonists according to the present invention. The yeast two 
hybrid system is described in Fields and Song, Nature 340:245-246 (1989). 

Epitope-Bearing Portions 

In another aspect, the invention provides a peptide or polypeptide 
comprising an epitope-bearing portion of a polypeptide of the invention. The 
epitope of this polypeptide portion is an immunogenic or antigenic epitope of 
a polypeptide of the invention. An "immunogenic epitope" is defined as a 
part of a protein that elicits an antibody response when the whole protein is 
the immunogen. On the other hand, a region of a protein molecule to w hich an 
antibody can bind is defined as an "antigfnic epitope." The number of 
immunogenic epitopes of a protein gemraiiy is less than the number of 
antigenic epitopes. See. for instance, Geysen et a?.. Proc. Nad. Acad. Sci. USA 
*/:399S- 4002 (1983). 

As to the selection of peptides or polypeptides bearing an antigenic 
epitope (i.e., that contain a region of a protein molecule to which an antibody 
can bind), it is well known in thai art that relatively short synthetic peptides 
that mimic part of a protein sequence arc routinely capable of eliciting an 
antiserum that reacts with the partially mimicked protein. See, for instance, 
Sutcliffc. J. G., Shinnick, T. M , Green, N. and Learner, R. A. (1983) 
" Antibodies that react with predetermined sites on proteins " Science. 
jyV:^t)-6/S6 Peptides capable of eliciting protein-reactive sera are frequently 



represented in the primary sequence of a protein, can be characterized by a set 
of simple chemical rules, and are confined neither to immunodominant regions 
of intact proteins (i.e., immunogenic epitopes) nor to the amino or carboxyl 
terminals. Antigenic epitopc-bearing peptides and polypeptides of the 
invention are therefore useful to raise antibodies, including monoclonal 
antibodies, that bind specifically to a polypeptide of the invention. See. for 
instance, Wilson et aL Cell 57:767-778 (1984) at 777. 

Antigenic epitope-bearing peptides and polypeptides of the invention 
preferably contain a sequence of at least seven, more preferably at least nine 
and most preferably between about 15 to about 30 amino acids contained 
within th; amino acid sequence of a polypeptide cf the invention. 
Non-limiting examples of antigenic polypeptides or peptides that can be used 
to generate TNFR-specific antibodies include: a polypeptide comprising amino 
acid residues from about Gln-42 to about Glu-52, from about His-58 to about 
Cys-66, from about Pro-68 to about Thr-76, from about Ser-79 to about Cys- 
85, from about Cys-91 to about Thr-102, from about Gin- 110 to about Pro- 
122, from about Arg-126 to about Val-136, and from about Thr-142 to about 
Glu-148 in SEQ ID NO:2; from about Ala-3l to about Thr-46. from about 
Phe-57 to about Thr-1 17, from about Cys-132 to about Thr-175, from about 
Gly-185 to about Thr-194, from about Val-205 to about Asp-217. from about 
Pro-239 to about Leu-264, and from about Ala-283 to about Pro-298 in SEQ 
ID NO:4; and from about Ala-31 to about Thr-46. from about Phe-57 to about 
GIn-RO, from about Glu-86 to about His-106, from about Thr-108 to about 
Phe-1 19, from about His- 129 to about Val-138. and from about Gly-142 to 
about Pro- 1 66 in SEQ ID NO:6. These polypeptide fragments have been 
determined to bear antigenic epiu-pes of the TNFR-5. -6a and -6p 
polypeptides respectively, by the analysis of the Jameson-Wolf antigenic index, 
as *hown in Figures 5, 6 and 7, above. 

The epitope-bearing peptides and polypeptides of the invention may 
be produced by any conventional means. See, e.g., Houghtcn, R. A. (1985) 
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"General method for the rapid solid-phase synthesis of large numbers of 
peptides: specificity of antigen-antibody interaction at the level of individual 
amino acids " Proc Sail. Acad. ScL USA ^2:5131-5135; this "Simultaneous 
Multiple Peptide Synthesis (SNIPS)" process is further described in U.S. 
Patent No. 4.631.21 1 to Houghtcn ct al. (1986). 

Epitope-beanng peptides and polypeptides of the invention are used 
to induce antibodies according to methods well known in the art. See, for 
instance. Sutcliffe et al'.. supra: Wilson et al., supra; Chow, M. et al., Proc. 
Sail. Acad. Sci. USA $2:9 10-9 14; and Bittle, F. J. et aL J Gen. Virol. 
6/5:2347-2354 (1985). Immunogenic epitope-bearing peptides of the 
invention, i.e., those parts of a protein that elicit an antibody response when 
the whole protein is the immunogen, are identified according to methods 
known in the art. See. for instance. Geysen et al., supra. Further still. U.S. 
Patent No. 5.194.392 to Geysen (1990) describes a general method of 
detecting or determining the sequence of monomers (amino acids or other 
compounds) which is a topological equivalent of the epitope (i.e., a 
"mimotope") which is complementary to a particular paratope (antigen binding 
site) of an antibody of interest. More generally, U.S. Patent No. 4,433.092 to 
Geysen (1989) describes a method of detecting or determining a sequence of 
monomers which is a topographical equivalent of a ligand which is 
complementary to the ligand binding site of a particular receptor of interest. 
Similarly. U.S. Patent No. 5.4S0.971 to Houghten, R. A. et aL (1996) on 
Feralkylated Oligopeptide Mixtures discloses linear C l-C7-alkyl peralkylated 
oligopeptides and sets and libraries of such peptides, as well as methods for 
using such oligopeptide sets and libraries for determining the sequence of a 
peralkylated oligopeptide that preferentially binds to an acceptor molecule of 
interest Thus, non-peptide analogs of the epitope-beanng peptides of the 
invention also can be made routinely by these methods. 



the production of sera containing polyclonal antibodies. In a preferred 
method, a preparation of TNFR protein is prepared and purified to render it 
substantially free of natural contaminants. Such a preparation is then 
introduced into an animal in order to produce polyclonal antiscra of greater 
specific activity. 

In the most preferred method, the antibodies of the present invention 
are monoclonal antibodies. Such monoclonal antibodies can be prepared using 
hybridoma technology (Kohler ct aL Nature 256:495 (1975); Kohlcr ct aL 
Eur. J. Immunol. 6:511 (1976); Kohlcr ct aL Eur. J. Immunol. 6:292 ( 1976); 
Hammcrling ct aL in: Monoclonal Antibodies and T-Ccll Hyhridomas. 
Elsevier, N.Y., (1981) pp. 563-68! ). In general, such procedures involve 
immunizing an animal (preferably a mouse) with a TNFR protein antigen or. 
more preferably, with a TNFR protein-expressing cell. Suitable cells can be 
recognized by their capacity to bind anti-TNFR-5, -6a Oi -6p protein 
antibody. Such cells may be cultured in any suitable tissue culture medium; 
however, it is preferable to culture cells in Earlc's modified Eagle's medium 
supplemented with 10% fetal bovine serum (inactivated at about 56° C), and 
supplemented with about 10 g/l of nonessential amino acids, about 1.000 U 'ml 
of penicillin, and about 100 ug'ml of streptomycin. The splcnocytcs of such 
mice are extracted and fused with a suitable myeloma cell line. Any suitable 
myeloma cell line may be employed in accordance with the present invention; 
however, it is preferable to employ the parent myeloma cell line (SP20), 
available from the American Type Culture Collection, Rockviile. Maryland. 
After fusion, the resulting hybndoma cells arc selectively maintained in HAT 
medium, and then cloned by limiting dilution as described by Wands ct al. 
(Gastroenterology 80:225-232 ( 19X1 )). The hybridoma cells obtained through 
such a selection are then assayed to identify clones which secrete antibodies 
capable of binding the desired TNFR antigen. 

' " Alternatively, additional antibodies capable of binding to the TNFR 
antigen may be produced in a two-step procedure through the use of 



anti-idiorypic antibodies. Such a method makes use of the fact that antibodies 
arc themselves antigens, and that, therefore, it is possible t obtain an antibody 
which binds to a second antibody. In accordance with this method, TNFR- 
protcin specific antibodies arc used to immunize an animal, preferably a 
mouse. The splcnocytcs of such an animal arc then used to produce 
hybridoma cells, and the hybridoma cells are screened to identify clones which 
produce an antibody whose ability to bind to the TNFR protein-specific 
antibody can be blocked by the TNFR protein antigen. Such antibodies 
comprise anti-idiorypic antibodies to the TNFR protein-specific antibody and 
can be used to immunize an animal to indue? formation of further TNFR 
protein-specific antibodies. 

It w ill be appreciated that Fab and F(ab')2 and other fragments of the 
antibodies of the present i: vention may be used according to the methods 
disclosed herein. Such fragments are typically produced by proteolytic 
cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin 
(to produce F(ab')2 fragments). Alternatively, TNFR protein-binding 
fragments can be produced through the application of recombinant DNA 
technology or through synthetic chemistry. 

For in vivo use of anti-TNFR in humans, it may be preferable to use 
"humanized" chimeric monoclonal antibodies. Such antibodies can be 
produced using genetic constructs derived from hybridoma cells producing the 
monoclonal antibodies described above. Methods for producing chimeric 
antibodies are known in th= art. See, for ;e view , Morrison. Science 229: \ 202 
(1985); Oi et al., BioTcchniqucs 4:214 (1986); Cabilly et al., U.S. Patent No. 
4,816.567; Taniguchi et al., EP 171496; Momson et al., EP 173494; Neubergcr 
et al., WO 8601533; Robinson et al.. WO 8702671 ; Boulianne ct ai. Saturc 
312:643 ( 1 9X4); Neuberger et aL Saturc i/-/:268 (1985). 
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Where a diagnosis of a disorder in the immune svstcm including 
diagnosis of i. .u.nor has already been made according to conventional methods, 
the present invention is useful as a prognostic indicator, whereby patients 
exhibiting depressed gene expression will experience a worse clinical outcome 
relative to patients expressing the gene at a level nearer the standard level 

By "assaying the expression level of the gene encoding a TNFR 
protein" is intended qualitatively or quantitatively measuring or estimating the 
level of the TNFR-5. -6a and/or -6P protein or the level of the mRNA 
encoding the TNFR-5, -oa and/or -op protein in a first biological simple cither 
directly leg., by determining or estimating absolute protein level or mRNA 
level) or relatively (e.g., by comparing to the TNFR protein level or mRNA 
level in a second biological sample). Preferably, the TNFR protein level or 
mRNA level in the first biological sample is measured or estimated and 
compared to a standard TNFR protein level or mRNA level, the standard being 
taken from a second biological sample obtained from an individual not having 
the disorder or being determined by averaging levels from a population of 
individuals not having a disorder of the immune system As will be 
appreciated in the art, once standard TNFR protein lex els or mRNA levels are 
known, they can be used repeatedly as a standard for comparison. 

By "biological >ample" is intended any biological >ample obtained from 
an individual, body fluid, cell line, tissue culture, or other source which 
contains TNFR piotem or mRNA. As indicated, biological samples include 
body fluids (such a- -era. pla>ma. urine. <>no\ial fluid and spinal fluid) which 
contnm free extracellular J -.unto (or -Suable tonn(s)) of a TNFR protein. 

; - icr :->ue source- tound to express complete or 
> \i-j* ; iods for orMainmt: ti^ue biopsies and 
•v.!.- .it.- ac. - -own in the an W he:e the biological 
- . is the ctci i cd >ouree. 

\. vi-c of a ucne of the present 
. t 1 i»t e\.:mple. if a 
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mutation is present in one of the genes of the present invention, conditions 
would result from a lack of production of the receptor polypeptides of the 
present invention Further, mutations which enhance receptor polypeptide 
activity would lead to diseases associated with an over expression of the 
receptor polypeptide, e.g.. endotoxic shock. Mutations in the genes can be 
detected by comparing the sequence of the defective gene with that of a nonnal 
one. Subsequently one can verily that a mutant gene is associated with a 
disease condition or the susceptibility to a disease condition. That is, a mutant 
gene which leads to the undcrcxprcssion of the receptor polypeptides of the 
present invention would be associated with an inability of TNF to inhibit 
tumor growth. 

Other immune system disorders which may be diagnosed by the 
foregoing assays include hypersensitivity, allergy, infectious disease, graft- 
host disease, immunodeficiency, autoimmune diseases and the like. 

Individuals carrying mutations in the genes of the present invention 
may be detected at the DN A level by a variety of techniques. Nucleic acids 
used for diagnosis may be obtained from a patient's cells, such as from blood, 
urine, saliva and tissue biopsy among other tissues. The genomic DN A may 
be used directly for detection or may be amplified cnzymatieally by using 
HCR (Saiki et al . Nature. 324: 163-166 ( 19S6)j prior to analysis. RNA or 
cDN A may also be used for the same purpose. As an example. PCR primers 
complementary to the nucleic acid of the instant invention can be used to 
identify and analy7e mutations in the human genes of the present invention. 
For example, deletions and insertions can be detected by a change in the size of 
the amplified product in comparison to the rormal genotype. Point mutations 
can be identified by hybridi7ing amplified DNA to radiolabeled RNA or 
alternate el>. radiolabeled antisense DNA sequences of the present invention. 
PerfectU inched sequences can be distinguished tivin iniMu.itv.lied duplex 
by RNase A digestion or by differences in mclum: u nip. kuuu- Siu h a 
diagnose would be particularly u>efu! !• •< pun 1,1 ■ wn MO " ,l ' 



such as deuterium, which may be incorporated into the antibody by labeling of 
nutrients for the relevant hybridoma. 

A TNFR-spccific antibody or antibody fragment which has been 
labeled with an appropriate detectable imaging moiety, such as a radioisotope 

5 (for example, m I. ,i: In, ^Tc), a radio-opaque substance, or a material 

detectable by nuclear magnetic resonance, is introduced (for example, 
parcntcrally. subcutaneously or intrapcntoncally) into the mammal to be 
examined for immune system disorder. It will be understood in the an that the 
size of the subject and the imaging system used will determine the quantity of 

in imaging moiety needed to produce diagnostic images. In the case of a 

radioisotope moiety, for a human subject, the quantity of radioactivity injected 
will normally range from about 5 to 20 millicuncs of " m Tc. The labeled 
antibody or antibody fragment will then preferentially accumulate at the 
location of cells w hich contain TNFR protein. In vivo tumor imaging is 

1 5 described in S. W. Burchicl et at., "Immunopharmacokinctics of Radiolabeled 

Antibodies and Their Fragments" (Chapter 13 in Tumor Imaging: The 
Radiochemical Detection of Cancer. S.W. Burchiel and B. A. Rhodes, eds., 
Masson Publishing Inc. (1982)). 



Treatment 

The Tumor Necrosis Factor (TNF) family ligands are known to be 
among the most plciotropic cytokines, inducing a large number of cellular 
responses, including cytotoxicity, anti-viral activity, immunoregulatory 
activities, and the transcriptional regulation of several genes (Goeddel, D.V et 
u/- "Tumor Necrosis Factors: Gene Structure and Biological Activities." 
Symp. Quant Biol 51:597-609 (19X6). Cold Spring Harbor; Bcutler, B., and 
Cerami, A.. Annu. Rev. Bun-hem. 57:505-518 (1988); Old, L.J.. Sci. Am. 
25*. 59-75 (I9SX); Hers. \\\. h'EBS Lett. 2*5: 1 99-224 (1991)). The TNK- 
family ligands induce such various cellular responses by binding to TNh- 
family receptors. Cells which express a TNFR polypeptide and have a potent 
cellular response to TNFR-5. -6a and -6p ligands include lympnocytes, 
endothelial cells, keratinocytes. and prostate tissue. By "a cellular response to 
a TNF -family hgar.J" is intended any genotypic, phenotypic, and or 



of the present invention by contacting the mclanophorc cells which encode the 
receptor with both a TNF-family ligand and the candidate antagonist (or 
agonist). Inhibition or enhancement of the signal generated by the ligand 
indicates that the compound is an antagonist or agonist of the ligand/rcccptor 
signaling pathway. 

Other screening techniques include the use of celiS which express the 
receptor (for example, transfeced CHO cells) in a system which measures 
extracellular pH changes caused by receptor activation, for example, as 
describe* in Science 246:\ 81-296 (October 1989). For example, compounds 
•^ay be contacted with a cell which expresses the receptor polypeptide of the 
present invention and a second messenger response, e.g.. signal transduction or 
pH changes, may be measured to determine whether the potential compound 
activates or inhibits the receptor. 

Another such screening technique involves introducing RNA encoding 
the receptor into Xcnopus oocytes to transiently express the receptor. The 
receptor oocytes ma> then be contacted with the receptor ligand and a 
compound to be screened, followed by detection ->f inhibition or activation of a 
calcium signal in the case of screening for compounds which are thought to 
inhibit activation of the receptor. 

Another screening technique involves expressing in cells a construct 
wherein the receptor is linked to a phospholipasc C or D. Such cells include 
endothelial cells, smooth muscle cells, embryonic kidney cells, etc. The 
screening may be accomplished as hereinabove described by detecting 
activation of the receptor or inhibition of activation of the receptor from the 
phospholipasc signal. 

Anothe. method involves screening for compounds which inhibit 
activation of the receptor polypeptide of the present invention antagonists by 
determining inhibition of binding of labeled ligand to cells which have the 
receptor on the surface thereof. Such a method involves transfecting a 
cukaryotic cell with DNA encoding the receptor such that the cell expresses 
the receptor on its surface and contacting the cell with a compound in the 
presence of a labeled form of a known ligand. The ligand can be labeled, e.g.. 
by radioactivity The amount of labeled ligand bound to the receptors is 
measured, e.e . by measuring radioactivity of the receptors. If the compound 
binds to the receptor as determined by a reduction of labeled ligand which 
binds to the receptors, the binding of labeled ligand to the receptor is inhibited. 
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Further screening assays for agonist and antagonist of the present • 
invention are described in Tartaglia, L.A., and GoeddcK D.V., / Biol. Chcm. 
267f7>:4304-4307(1992). 

Thus, in a further aspect, a screening method is provided for 
determining whether a candidate agonist or antagonist is capable of enhancing 
or inhibiting a cellular response to a TNF-family ligand. The method involves 
contacting ceils which express the TNFR polypeptide with a candidate 
compound and a TNF-family ligand, assaying a cellular response, and 
comparing the cellular response to a standard cellular response, the standard 
being assayed when contact is made with the ligand in absence of the candidate 
compound, whereby an increased cellular response ovc. the standard indicates 
that the candidate compound is an agonist of the ligand/rcceptor signaling 
pathway and a decreased cellular response compared to the standard indicates 
that the candidate compound is an antagonist of the ligand/rcceptor signaling 
pathway. By "assaying a cellular response" is intended qualitatively or 
quantitatively measuring a cellular response to a candidate compound and'or a 
TNF-family ligand (e.g., determining or estimating an increase or decrease in T 
cell proliferation or tritiated thymidine labeling). By the invention, a cell 
expressing the TNFR polypeptide can be contacted with either an endogenous 
or exogenously administered TNF-family ligand. 

Agonist according to the present invention include naturally occurring 
and synthetic compounds such as, for example, TNF family ligand peptide 
fragments, transforming growth factor, neurotransmitters (such as glutamatc. 
dopamine. V-methyl-D-aspartate), tumor suppressors (p53). cytolytic T cells 
and antimetabolites. Preferred agonist include chemothcrapcutic drugs such as. 
for example, cisplatin, doxorubicin, bleomycin, cytosine arabinoside. nitrogen 
mustard, methotrexate and vincristine. Others include cthanol and -amyloid 
peptide. (Science 1 457- I45X (1995)). Further preferred agonist include 
polyclonal and monoclonal antibodies raised against the TNFR polypeptide, 
or a fragment thereof. Such agonist antibodies raised against a TNF family 
receptor are disclosed in Tartaglia. L.A.. el al.. Proc. Sati Acad. ScL USA 
*.V:9292-9296 ( 1991 ); and Tartaglia. L A., and Gocddcl. D.V., ./. Bit>t. Chcm. 
4304-4307 ( 1992) See. also. PCT Application WO 94 09137. 
Antagonist according to the present invention include naturally 
incurring and synthetic compounds such as. for example, the CD40 ligand. 
neutral ammo acids, /inc. estrogen, androgens, viral genes (such as Adenovirus 
/-.//*; U>iculowru> and ZAP. ( mvpox virus crmA. Lpstcm-Harr vims 

HUHI I IMP I African swine tever xirus IMU5-IIL. and Herpesvirus yl 



34,5), calpain inhibitors, cysteine protease inhibitors, and tumor promoters 
(such as PMA, Phcnobarbital. and -Hcxachlorocyclohcxanc). Other 
antagonists include polyclonal and monoclonal antagonist antibodies raised 
against the TNFR polypeptides or a fragment thereof. Such antagonist 
antibodies raised against a TNF-family receptor arc described in Tartaglia, 
LA, and Goeddcl. D.V., J. Biol. Chcm. 2*7^:4304-4307 (1992) and 
Tartaglia. L.A. et aL Cell 7j:213-216 (1993). Sec, also, PCT Application WO 
94,09137. 

Other potential antagonists include antiscnse molecules. Antiscnsc 
technology can be used to control gene expression through antiscnsc DNA or 
RNA or through mple-hclix formation. Antiscnsc techniques arc discussed, 
for example, in Okano, . Ncuroehcm. 5/5:560 v 199 1); Oligodcoxy nucleotides as 
Antiscnsc Inhibitors of Gene Expression. CRC Press, Boca Raton. FL (1988). 
Triple helix formation is discussed in, for instance Lee et aL Nucleic Acids 
Research 6:3073 ( 1979); Tooncy et aL Science 241:456 (1988); and Dcrvan ct 
aL Science 251: 1360 (1991). The methods arc based on binding of a 
polynucleotide to a complementary DNA or RNA. 

For example, the 5' coding portion of a polynucleotide that encodes the 
mature polypeptide of the present invention may be used to design an 
antiscnse RNA oligonucleotide of from about 10 to 40 base pairs in length. A 
DNA oligonucleotide is designed to be complementary to a region of the gene 
involved in transcription thereby preventing transcription and the production 
of the receptor. The antiscnsc RNA oligonucleotide hybridizes to the mRNA 
in vivo and blocks translation of the mRNA molecule into receptor 
polypeptide. The oligonucleotides described above can also be delivered to 
cells such that the antiscnsc RNA or DNA may be expressed in vivo to inhibit 
production of the receptor. 

Further antagonist according to the present invention include soluble 
forms of TNFR. i.e., TNFR fragments that include the ligand binding domain 
from the extracellular region of the full length receptor. Such soluble forms of 
the receptor, which may be naturally occurring or synthetic, antagonize TNFR 
mediated signaling by competing with the cell surface TNFR for binding to 
TNF-family ligands. Thus, soluble forms of the receptor that include the 
ligand binding domain are novel cytokines capable of inhibiting tumor necrosis 
induced by TNF-family ligands. Other such cytokines are known in the art 
and include Fas B (a soluble form of the mouse Fas receptor) that acts 
physiologicalU to limit apoptosis induced by Fas ligand (Hughes. D P. and 
Crispe. I.N ,J tlyp Med /*.?: I 395- 1 401 (IW5)). 




As indicated polyclonal and monoclonal antibody agonist or antagonist 
according to the present invention can be raised according to the methods 
disclosed in Tartaglia. L.A.. and Goeddel. D.V.. J. Biol. Chen, 2«7f 7*4304- 
4307(1992): Tartaelia. L.A. a oL Cell 73:213-216 (1993). and PCT 
^plication WO 94 09137. The term "antibody" (Ab) or "monoclonal 
ant.bodv" (mAb) as used herein is mean, to include intact molecules as well as 
fragments thereof (such as. for example. Fab and F(ab') ; fragments) which are 
capable of binding an antigen. Fab and F (ab') : fragments lack the Fc fragment 
of .ntact antibodv. clear more rapidly from .he circulation, and may have less 
non-specific tissue binding of an intac, antibody (Wahl d at.. J. NucL Meet. 

_V:316-325 (1983)). 

Antibodies according to the present invention .nay be prepared by any 
of a variety of methods described above, and known in the art 

Protc.ns and other compounds which bind the extracellular domains are 
also candidate aeor.ist and antagonist according to the present invention. Such 
binding compounds can be "captured" using the yeast two-hybnd system 
(Fields" and Song. Xumre 340:245-246 (1989)). A modified version of the yeast 
rwo-hvbrid system has been described by Roger Brent and his colleagues 
(Gyuris. i. ct al.. Cell 75:791-803 (1993); Zcrvos. A.S. ct at.. Celt 72:223-232 

(1993)). .. , 

Bv a "TNF-family ligand" is intended naturally occurring, recombinant, 
and svnthetic ligands that arc capable of binding to a member of the TNF 
receptor family and inducing the ligand'reccptor signaling pathway. Members 
of the TNF liuand family include, but arc not limited to. the TNFR-5. -6a & - 
6[5 ligands. TNF -a . lymphdtox.n-a (LT-a . also known as TNF-(J ). LT-0. 
FasL~ CD40. CD27. CD30. 4-1BB. OX40 and nerve growth factor (NGF). 

Representative therapeutic applications of the present invention arc 
d.scussed in-more detail below. The state of immunodeficiency that defines 
AIDS is secondary to a decrease in the number and function of CD4 
T-lymphocy,cs. Recent reports estimate the da.ly loss of CD4" T cells to be 
between 3.5 X 107 and 2 X 109 cells (Wei X.. el at.. Xaiurc 373:1 17-122 
(1995)). One cause of CD4" T cell depletion in the setting of HIV infection is 
believed to be I UV-induced apoptos.s. Indeed. IHV-induccd apoptot.c cell 
death has been demonstrated not only in vitro but also, more importantly, in 
infected individuals (Ame.sen. J.C.. AIDS *! 197-1213 (1994, ; F.nkel. T.H.. 
andflanda. NX.. Cur. Opin. Immunol. rf:r,05-M5< 1995): Muro-Cacho. C.A. 
,,,„/ j Immunol /.^:5555-556h ( IWII Furthermore, apoptosis and CIM 
I-Kmphocvte depletion .s tightly correlated in diffcrcn. animal models of 



AIDS<Brunnci\T.. ct aL Nature J7J:441-444 (1995); Gougcon, MX., ct aL 
AIDS Res. Hum. Retroviruses 9:553-563 (1993)) and, apoptosis is not 
observed in those animal models in which viral replication docs not result in 
AIDS (Gougcon, Mi. et a/.. AIDS Res. Hum. Retroviruses 9:553-563 (1993)). 
Further data indicates that uninfected but primed or activated T lymphocytes 
from HIV-infected individuals undergo apoptosis after encountering the TNF- 
family ligand FasL. Using monocytic cell lines that result in death following 
HIV infection, h has been demonstrated thot infection of U937 cells with HIV 
results in the Je novo expression of FasL and that FasL mediates !IIV-induccd 
apoptosis (Badlcy. A.D. et aL J. Virol 70:199-206 (1996)). Further the 
TNF-family ligand was detectable in uninfected macrophages and its 
expression was uprcgulatcd following HIV infection resulting in selective 
killing of uninfected CD4 T-lymphocytcs (Badley, A.D ct aL J. Virol. 70: 1 99- 
206 (1996)). Thus, by the invention, a method for treating HIV* individuals is 
provided which involves administering an antagonist of the present invention 
to reduce selective killing of CD4 T-lymphocytcs. Modes of administration 
and dosages arc discussed in detail below. 

In rejection of an allograft, the immune system of the recipient animal 
has not previously been primed to respond because the immune system for the 
most part is only pnmcd by environmental antigens. Tissues from other 
members of the same species have not been presented in the same way that, 
for example, viruses and bacteria have been presented. In the case of allograft 
rejection, immunosuppressive regimens are designed to prevent the immune 
system from reaching the effector stage. However, the immune profile of 
xenograft rejection may resemble disease recurrence more that allograft 
rejection. In the case of disease recurrence, the immune system has already 
been activated, as evidenced by destruction of the native islet cells. Therefore, 
in disease recurrence the immune system is already at the effector stage. 
Agonist of the present invention are able to suppress the immune response to 
both allografts and xenografts because lymphocytes activated and 
differentiated into effector ceils will express the TNFR polypeptide, and 
thereby are susceptible to compounds which enhance TNFR activity. Thus, 
the present invention further provides a method for creating immune privileged 
tissues. Antagonist of the invention can further be used in the treatment of 
Inflammatory Bowel-Disease. 



Formulations 

The TNFR polypeptide composition will be formulated and dosed in a 
fashion consistent with good medical practice, taking into account the clinical 
condition of the individual patient (especially the side effects of treatment 
with TNFR-5, -6a or -6(3 polypeptide alone), the site of delivery of the 
TNFR polypeptide composition, the method of administration, the scheduling 
of adminisrration, and other factors known to practitioners. The "effective 
amount" of TNFR polypeptide for purposes herein is thus determined by 
such considerations. 

As a general proposition, the total pharmaceutical^ effective amount 
of TNFR polypeptide administered parentcrally per dose will be in the range 
of about I ug Vg'day to 10 mg/kg'day of patient body weight, although, as 
noted above, this will be subject to therapeutic discretion. More preferably, 
this dose is at least 0.01 mg "kg/day, and most preferably for humans between 
about 0.01 and 1 mg kg/day for the hormone. If given continuously, the TNFR 
P ol> peptide is typically administered at a dose rate of about 1 ug/kg/hour to 
about 50 uu VgTiour, cither by 1 -4 injections per day or by continuous 
subcutaneous infusions, for example, using a mini-pump. An intravenous bag 
solution may also be employed. The length of treatment needed to observe 
changes and the interval following treatment for responses to occur appears to 
vary depending on the desired effect. 

Pharmaceutical compositions containing the TNFR of the invention 
may be administered orally, rcctally. parcnterally. intracistcmally, 
intravaginally. intrapcriu neally. topically (as by powders, ointments, drops oi 
transdermal patch). " ucally, or as an oral or nasal spray. By 
' pharmaceutical!) acceptable carrier*' is meant a non-toxic solid, semisolid or 
liquid filler, dilue-t. encapsulating material or formulation auxiliary of any 
type." The term "parenteral" as used herein refers to modes of administration 



which include intravenous, intramuscular, intraperitoneal, intrastcmal, 
subcutaneous and intraarticular injection and infusion. 

The TNFR polypeptide is also suitably administered by sustained- 
release systems. Suitable examples of sustained-release compositions include 
scini-pc-mr-olc polymer matrices in the form of shaped articles, e.g.. films, or 
mirocapsulcs. Sustaincd-rclcasc matrices include polylactidcs (U.S. Pat. No. 
3,773,919. EP 58,481 ), copolymers of L-glutamic acid and gamma-cthyt-L- 
glutamatc (Sidman. U. ct al., Biopohmcrs J2:547-556 (1983)), poly (2- 
hydroxycthyl mcthacrylatc) (R. Langer et al., J. Biomcd. Mater. Res /5:167- 
277 (1981). and R. Langer, Chcm. Tech. 72:98-105 ( 1982)), ethylene vinyl 
acetate (R. Langer ct a!.. Id.) or poly-D- (-)-3-hydroxybutyric acid (EP 
1 33.988). Sustaincd-rclcasc TNFR polypeptide compositions also include 
l.posomally entrapped TNFR polypeptides. Liposomes containing TNFR 
^jlype* . ics arc prepared by methods known per se: DE 3.218,121; Epstein 
cx al.. Proc. Natl. Acad. Sci. (USA) 82:3688-3692 ( 1985): Hwang et al., Proc. 

all. Acad. Sci. (USA) 77:4030-4034 (1980); EP 52,322; EP 36,676; EP 
SN.046; EP 143.949; EP 142.641; Japanese Pat. Appl. 83-1 18008; U.S. Pat. 
Nos. 4.485.045 and 4.544.545; and EP i 02.324. Ordinarily, the liposomes are 
of ihc small (abnut 200-800 Angstroms) unilamellar type in which the lipid 
content is greater tha.i about 30 mol. percent cholesterol, the selected 
proportion being adjusted for the optimal TNFR polypeptide therapy. 

For parenteral administration, in one embodiment, the TNFR 
polypeptide is formulated generally by mixing it at the desired degree of 
purity, in a unit dosage injectable form (solution, suspension, or emulsion), 
with a pharmaceutical^ acceptable carrier, i.e., one that is non-toxic to 
recipients at the dosages and concentrations employed and is compatible with 
other ingredients of the formulation. For example, the formulation preferably 
does not include oxidizing agents and other compounds that are known to be 
deleterious to polypeptides. 



Generally, the formulations arc prepared by contacting the TNFR 
polypeptide uniformly and intimately with liquid carriers or finely divided 
solid carriers or both. Then, if necessary, the product is shaped into the 
desired formulation. Preferably the carrier is a parenteral carrier, more 
preferably a solution that is isotonic with the blood of the recipient. Examples 
of such earner vehicles include water, saline. Ringer's solution, and dextrose 
solution. Non aqueous * chicles such as fixed oils and ethyl oleate arc also 
useful herein, as- well as liposomes. 

The carrier suitably contains minor amounts of additives such as 
substances that enhance isotonicity and chemical stability. Such materials arc 
non-toxic to recipients at the dosages and concentrations employed, and 
include buffers such as phosphate, citrate, succinate, acetic acid, and other 
organic acids or their salts; antioxidants such as ascorbic acid; low molecular 
weight (less than about ten residues) polypeptides, e.g., polyargininc or 
tripeptides; proteins, such as scrum albumin, gelatin, or immunoglobulins; 
hydrophilic polymers such as polyvinylpyrrolidone; amino acids, such ,>s 
glvcine. glutamic acid, aspartu acid, or argininc; monosaccharides, 
disaeehandes. and other carbohydrates including cellulose or its derivatives, 
glucose, manose, cr dcxtrins; chelating agents such as EDTA; sugar alcohols 
such as mannitol or sorbitol; countenons such as sodium; and or nonionic 
surfactants such as polysorbates. poloxamcrs. or PEG. 

The TNKR polypeptide is typically formulated in such vehicles at a 
concentration of about 0. 1 mg ml to 100 mg ml. preferably 1 - i0 mg ml. at a pH 
of about 3 to 8. It will be understood that the use of certain of the f.rcgoing 
excipicnts. carriers, or stabilizers will result in the formation of TNFR 
polypeptide salts. 

TNFR polypeptides to be used for therapeutic administration must be 
Steele. Sterility is readily accomplished by filtration through sterile filtration 
membranes ve.g.. 0.2 micron membranes). Therapeutic TNFR polypeptide 
compositions generally are placed into a con'aincr having a sterile access port. 



Examples 

Example 1(a): Expression and Purification of "His-tawed" Extracellular 
form of TSFR-5 in £. coii 

The bacicnal expression vector pQE9 (pDIO) is used for bacterial 
expression in this example. (QIAGEN, Inc., 9259 Eton Avenue, Chatsworth, 
CA. 91311). pQE9 encodes ampicillin antibiotic resistance ("Ampr") and 
contains a bacteria! origin of replication ("orD. an IPTG inducible promoter, a 
nbosomc binding site ("RBS"). six codons encoding histidinc residues that 
allow affinity purification using mckcl-nitrilo-tri-acctic acid ("Ni-NTA ) 
affinity resin sold by QIAGEN, Inc.. supra, and suitable single restriction 
enzyme cleavage sites. These elements are arranged such that an inserted 
DNA fragment encoding a polypeptide expresses that polypeptide with the 
six His residues (i.e.. a "6 X His tag") covalently linked to the amino terminus 
of that polypeptide. 

The DNA sequence encoding the desired portion of the TNFR-5 
protein comprising the extracellular form of the TNFR-5 amino acid sequence 
is amplified from the deposited cDNA clone using PCR oligonucleotide 
primers which anneal to sequence encoding the amino terminal sequences of 
the desired portion of the TNFR-5 protein and to carboxy terminal sequences 
of the desired portion of the extracellular form of the TNFR-5 protein in the 
deposited cDNA. Additional nucleotides containing restriction sites to 
facilitate cloning in the pQE9 vector are added to the 5' and 3* primer 
sequences, respectively. 

For cloning the extracellular form of the TNFR-5 protein, the 5* primer 
has the sequence 5' CGC GGATCC ACCACTGCCCGGCAGGAG 3' (SEQ 
ID NO: 25) containing the underlined BamHI restriction site followed by IS 
nucleotides of the amino terminal coding sequence of the extracellular TNFR-5 
. . sequence in SEQ ID NO:2. One cf ordinary skill in the art would appreciate, 
of course, that the point in the protein coding sequence where the 5' primer 



begins and where the 3' pnmcr ends may be varied to amplify a DNA segment 
encoding any desired portion of the complete TNFR-5 protein shorter or 
longer than the extracellular form of the protein. The 3* primer has the 
<rqiicnce 5» GCG TCT AG A CTAGTA ATGAG A AGACGCAGG 3* (SEQ ID 
NO:26) containing the underlined Xbal restriction site followed by 18 
nucleotides complementary to the 3' end of cDNA encoding the extracellular 
domain of the TNFR-5 protein in Figure 1. 

The amplified TNFR-5 DNA fragment and the vector pQE9 arc 
digested with Bam! II and Xbal and the digested DNAs arc then Hgated 
together. Insertion of the TNFR-5 DNA into the restricted pQE9 vector 
places the TNFR-5 protein coding region downstream from the IPTG- 
induciblc promoter and in-frame with an initiating AUG and the six histidinc 
codons. 

The ligation mixture is transformed into competent £. colt cells using 
standard procedures such as those described in Sambrook et a!.. Molecular 
damns: a Laboratory Manual. 2nd Ed.: Cold Spring Harbor Laboratory 
Press. Cold Spring Harbor, NY (1989). E. colt strain M15 rep4. containinu 
multiple copies of the plasmid pRHP4. which expresses the lac repressor and 
confers kanamycin resistance ("Kanr"). is used in carrying out the illustrative 
example described herein. This strain, which is only one of many that are 
suitable for expressing TNFR-5 protein, is available commercially from 
QIAGEN. Inc.. supra. Transformants arc identified by their ability to grow 
on LB plates in the presence of ampicillin and kanamycin. Plasmid DNA is 
isolated from resistant colonics and the identity of the cloned DNA confirmed 
by restriction analysis. PCR and DNA sequencing. 

Clones containing the desired constructs are grown overnight ("O V) 
in liquid culture in LB media supplemented with both ampicillin (100 ug ml) 
- -and kanamycin (25 ug ml). The O N culture is used to inoculate a large 
culture, at a dilution of approximately 1 :25 to 1 :250. The cells are grown to an 



Example 1(b): Expression and Purification ofTNFR*6a and -6p in £. 
coli 

The bacterial expression vector pQE60 is used for bacterial expression 
in this example (QIAGEN, Inc. 9259 Eton Avenue, Chatsworth, CA, 9131 !). 
pQE60 encodes ampicillin antibiotic resistance ("Ampr") and contains a 
bacterial origin of replication ("ori"), an 1PTG inducible promoter, a ribosomc 
binding site ("RBS"), six codons encoding histidinc residues that allow afTniry 
punfication using nickcl-nitrilo-tri-acetic acid ("Ni-NTA") afTinity resin sold 
by QIAGEN, Inc., supra, and suitable single restriction enzyme cleavage sites. 
These elements are arranged such that a DNA fragment encoding a 
polypeptide may be inserted in such as way as to produce that polypeptide 
with the six His residues (i.e., a "6 X His tag") covalcntly linked to the 
carboxyl terminus of that polypeptide. However, in this example, the 
polypeptide coding sequence is inserted such that translation of the six His 
codons is prevented and, therefore, the polypeptide is produced with no 6 X 
His tag. 

The DNA sequences encoding the desired portions of TNFR-6a and - 
6p proteins comprising the mature forms of the TNFR-6a and -60 amino acid 
sequences are amplified from the deposited cDNA clones using PCR 
oligonucleotide primers which anneal to the amino terminal sequences of the 
desired portions of the TNFR-5, -6a or -6p proteins and to sequences in the 
deposited constructs 3* to the cDN A coding sequence. Additional nucleotides 
containing restriction sites to facilitate cloning in the pQE60 vector arc added 

to the 5' and 3' sequences, respectively. 

For cloning the mature form of the TNFR-6a protein, the 5* primer 

has the sequence 5* CGC CC ATGG CAGAAACACCCACCTAC 3' (SEQ ID 

NO:27) containing the underlined Ncol restriction site. One of ordinary skill 
"in the art would appreciate, of course, that the point in t!ie protein coding 

sequence where the 5' primer begins may be varied to amplify a desired 
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portion of the complete protein shorter or longer than the mature form. The 3 % 
primer has the sequence 5" CGC AAGCTT CTCTTTCAGTGCAAGTG 3' 
(SEQ ID NO:2S) containing the underlined I Iindlll restriction site. For cloning 
the mature form of the TNFR-60 protein, the 5' primer has the sequence of 
SEQ ID NO:2 7 above, and the 3" primer has the sequence 5* 
CGC AAGCTT CTCCTCAGCTCCTGCAGTG 3' (SEQ ID NO:29) 
containing the underlined Hindi!! restriction site. 

The amplified TNFR-6ct and -63 DNA fragments and the vector 
pQE60 arc digested with Nco! and Hindi!! and the digested DNAs arc then 
licatcd together. Insertion of the TNFR-6ot and -6p DNA into the restricted 
pQE60 vector places the TNFR-6ct and -6|J protein coding region including its 
associated stop codon downstream from the IPTG-induciblc promoter and in- 
frame with an initiating AUG. The associated stop codon prevents translation 
of the six histidine codons downstream of the insertion point. 

The ligation mixture is transformed into competent E. colt cells using 
standard procedures such as those described in Sambrook et a!.. Molecular 
Cloning: a Laboratory Manual. 2nd Ed.; Cold Spring Harbor Laboratory 
Press. Cold Spring Harbor. NY (1989). E. coli strain Ml 5 rep4. containing 
multiple copies of the plasmid pREP4. which expresses the lac repressor and 
confers kanamycin resistance ("Kanr"), is used in carrying out the illustrative 
example described herein. This strain, which is only one of many that are 
suitable for expressing TNFR-6ct or -6p protein, is available commercially 
from QIAGEN. Inc.. supra. Transformants arc identified by their ability to 
grow on LB plates in the presence of ampicillin and kanamycin. Plasmid DNA 
is isolated from resistant colonics and the identity of the cloned DNA 
continued by restriction analysis. PCR and DNA sequencing. 

Clones containing the desired constructs arc grown overnight ("O N") 
in liquid culture in LB media supplemented with both ampicillin ( 100 jig ml) 
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and kanamycin (25 ug/ml). The OTM culture is used to inoculate a large 
culture, at a dilution of approximately 1:25 to 1 :250. The cells arc grown to an 
optical density at 600 nm ( H OD600 M ) of between 0.4 and 0.6. isopropyl-b-D- 
thiogalactopyranosidc ("IPTG") is then added to a final concentration of I 
mM to induce transcription from the lac repressor sensitive promoter, by 
inactivating the lacl repressor. Cells subsequently arc incubated further for 3 
to 4 hours. Cells then arc harvested by ccntrifugation. 

To purify- the TNFR-6ot and -6p polypeptide, the cells arc then stirred 
for 3-4 hours at 4 C C in 6M guanidine-HCl. pi I 8. The cell debris is removed 
by ccntrifugation, and the supernatant containing the TNFR-6ct and -60 is 
dialyzcd against 50 mM Na-acctatc buffer pH 6, supplemented with 200 mM 
NaCI. Alternatively, the protein can be succcssfuily refolded by dialyzing it 
against 500 mM NaCI. 20°o glycerol 25 mM Tris/HCl pH 7.4. containing 
protease inhibitors. After renaturation the protein can be purified by ion 
exchange, hydrophobic interaction and size exclusion chromatography. 
Alternatively, an affinity chromatography step such as an antibody column 
can be used to obtain pure TNFR-6a and -6p protein. The purified protein is 
stored at 4 % C or frozen at -80° C. 

The following alternative method may be used to purify TNFR-5. -6a 
or -6P expressed in E volt when it is present in the form of inclusion bodies. 
Unless otherwise specified, all of the follow ing steps are conducted at 4-lO r C. 

Upon completion of the production phase of the E. coii fermentation, 
the cell culture is cooled to 4-10 c C and the cells arc harvested by continuous 
ccntrifugation at 1 5.000 rpm (I leraeus Sepatech). On the basis of the expected 
yield of protein per unit weight of cell paste and the amount of purified 
protein required, an appropriate amount of cell paste, by weight, is suspended 
in a buffer solution containing 100 mM Tris. 50 mM hDTA. pi I 7.4. The 
cells are dispersed to a homogeneous suspension using a high shear mixer. 
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The cells ware then lyscd by passing the solution through a 
microfluidizcr (Microfuidics, Corp. or APV Gaulin, Inc.) twice at 4000-6000 
psi. The homogenate is then mixed with NaCI solution to a final concentration 
of 0.5 M NaCI, followed by ccntrifrgation at 7000 xg for 15 min. The 
resultant pellet is washed again using 0.5M NaCI, 100 mM Tris, 50 mM 
EDTA. pH 7.4. 

The resulting washed inclusion bodies arc solubilizcd with 1.5 M 
guanidinc hydrochloride (GuHCl) for 2-4 hours. After 7000 xg centrifugation 
for 1 5 min., the pellet is discarded and the TNFR-5, -6a or -60 
polypeptidc-containing supernatant is incubated at 4°C overnight to allow 
further GuHCl extraction. 

Following high speed centrifugation (30,000 xg) to remove insoluble 
panicles, the GuHCl solubilizcd protein is refolded by quickly mixing the 
GuHCl extract with 20 volumes of buffer containing 50 mM sodium. pH 4.5, 
150 mM NaCI, 2 mM EDTA by vigorous stirring. The refolded diluted 
protein solution is kept at 4°C without mixing for 12 hours prior to further 
purification steps. 

To clarify the refolded tnf recep'^ polypeptide solution, a 
previously prepared tangential filtration unit equipped with 0.16 urn 
membrane filter u ith appropriate surface area (e.g., Filtron), equilibrated w ith 
40 mM sodium acetate, pi J 6.0 is employed. The filtered sample is loaded 
onto a cation exchange resin (e.g., Poros HS-50. Pcrseptivc Biosystems). The 
column is washed with 40 mM sodiur.. icetate. pll 6.0 and eluted with 250 
mM. 500 mM. 1000 mM. and 1500 mM NaCI in the same buffer, in a 
stepwise manner. The absorbanee at 2H0 mm of the effluent is continuously 
monitored. Fractions are collected and further analyzed by SDS-PAGE. 

Fractions containing the TNF receptor polypeptide are then pooled 
and mixed vwth 4 volumes of water. The diluted sample is then loaded onto a 
previously prepared set of tandem columns of strong amoi. (Poros 110-50. 



Perceptive Biosystcms) and weak anion (Poros CM-20. Pcrscptive 
Biosystems) exchange resins. The columns arc equilibrated with 40 mM 
sodium acetate, pi I 6.0. Both columns arc washed with 40 mM sodium 
acetate. pH 6.0. 200 mM NaCl. The CM-20 column is then clutcd using a 10 
column volume linear gradient ranging from 0.2 M NaCl, 50 mM sodium 
acetate, pll 6.0 to 1.0 M NaCL 50 mM sodium acetate. pH 6.5. Fractions arc 
collected under constant A ZM monitoring of the effluent. Fractions containing 
the TNFR-5. -6a or -60 polypeptide (determined, for instance, by 16% SDS- 

PAGH) arc then pooled. 

The resultant TNF receptor polypeptide exhibits greater than 95% 
purity after the above refolding and purification steps. No major contaminant 
bands arc observed from Commassie blue stained 16% SDS-PAGE gel when 5 
tig of purified protein is loaded. The purified protein is also tested for 
endotoxin LPS contamination, and typically the LPS content is less than 0.1 
ng ml according to LAL assays. 

Example 2: Cloning and Expression of TXFR-5, -6a and -6$ proteins in a 
Baculovirus Expression System 

In this illustrative example, the plasmid shuttle vector pA2 is used to 
insert the cloned DNA encoding complete protein, including its naturally 
associated secretory signal (leader) sequence, into a baculovirus to express the 
mature TNKR-5. -6a or -6(5 protein, using standard methods as described in 
Summers et al.A Manual oj ''Methods for Baculovirus lectors and Insect Cell 
Culture Procedures. Texas Agricultural Experimental Station Bulletin No. 
I 555 ( I yS7). 1 his expression vector contains the strong polyhedrin promoter 
of the Autographa cali/untna nuclear polyhedrosis virus (AcMNPV) followed 
by convenient restriction sites such as Rami II. Xba I and Asp718. The 
poKadenylation site of the simian virus 40 <"SV40 M ) is used for efficient 
poK.idenylaiion For easy selection of recombinant virus, the plasmid 
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contains the bcta-galactosidasc gene from E. colt under control of a weak 
Drosophila promoter in the same orientation, followed by the polyadcnylation 
signal of the polyhcdrin gene. The inserted genes a-c Hanked on both sides by 
viral sequences for c jll-mcdiatcd homologous recombination with wild-type 
" viral DNA to generate a viable vims that express the cloned polynucleotide. 

Many other baculovirus vectors could be used in place of the vector 
aiovc. such as pAc373. pVL941 and pAclM I . as one skilled in the art would 
readily appreciate, as long as the construct provides appropriately located 
signals for transcription, translation, secretion and the like, including a signal 
peptide and an in-frame AUG as required. Such vectors are described, for 
instance, in Luckow c! a!.. Virology 7 70:31-39 ( \9*<*). 

The cDNA sequence encoding the full length TNFR-5, -6a or -6p 
protein in a deposited clone, including the AUG initiatio i codon and the 
naturally associated teader sequence shown in SEQ ID NO:2, 4. or 6 is 
amplified using VCR oligonucleotide primers corresponding to the 5* and 3' 
sequenc-s of the gene. The 5* primer for TNFR-5 has the sequence 
5* CGC TCTAGA CCGCCATCATGGCCCGCATCCCCAAG 3* (SEQ ID 
NO:30i containing the underlined Xbal restriction enzyme site The 5* primer 
for TM'R-K* and -f>[3 has the sequence 

y CCC G G A T C C GCC A TC A TG AGG GCG TGG A GG GG CC AG 3' (SEQ 
ID NO 3 1 > containing the underlined Bam! II restriction enzyme site. Ail of 
the pre\ iousIv desenhedprimers encode an efficient signal tor initiation of 
truncation in eukaryotic cells, as described by Kozak. M.. J. \toi Ban. 
/V6 947-950 {I9X7|. The 3' primer for TNFR-5 has the sequence 
5' G C ( • T C T A G A CT A G T A A TC i A C i A A G A GG C AG ( i 3' (SEQ ID NC):32) 
containing the underlined Xba! restriction site. The 3* primer for TNFR-na 
has the sequence 5* CGCGGTACCCTCTTTCAGTGCAAGTG 3' I SEQ ID 
St; 33 i containing the underlined Asp~|x restriction site. The 3' primer for 
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TNFR-6P has the sequence 

5- CGC GGTACC CTCCTCAGCTCCTGCAGTG 3' (SEQ ID NO:34) 
containing the underlined Asp7 18 restriction site. 

The amplified fragment is isolated from a 1% agarose gel using a 
commercially available kit ("Gcncclcan." BIO 101 Inc.. La Jolla. Ca.). The 
fragment then is digested with the appropriate restriction enzyme for each of 
the primers used, as specified above, and again is purified on a 1 ? o agarose gel. 

The plasmid is digested with the same restriction enzymes and 
optionally, can be dephosphorized using calf intestinal phosphatase, using 
routine procedures known in the art. The DNA is then isolated from a 1% 
agarose gel using a commercially available kit ("Gcncclcan" BIO 101 Inc.. La 
Jolla. Ca ). 

The fragment and dephosphoryhted plasmid arc ligated together with 
T4 DNA ligasc. E. coli I IB 101 or other suitable E. colt hosts such as XL- 1 
Blue tStatagene Cloning Systems. La Jolla. CA) cells arc transformed with the 
ligation mixture and spread on culture plates. Bacteria arc identified that 
contain the plasmid with the human TNF receptor gene by digesting DNA 
from individual colonies using the enzymes used immediately above and then 
analyzing the digestion product by gel electrophoresis. The sequence of the 
cloned fragment is confirmed by DNA sequencing. This plasmid is designated 
herein P A2-TNFR-5. pA2-TNFR-6a or P A2TNFR-*P (collectively pA2- 
TNFR). 

Five ng of the plasmid pA2-TNFR is co-transfected with 1.0 ug of a 
commercially available linearized baculov.ru:> DNA ("BaculoGold™ 
baculovirus DNA", Phanningen. San Diego. CA). us.ng the lipofection method 
described by Feigner et al.. Prov. Sail Acad Sci. USA 741 3-741 7 (19X7). 
One pig of BaculoGold™ virus DNA and 5 ug of the plasmid pA2-TNFR are 
mixed in a sterile well of a microliter plate containing 50 ^1 of serum-free 
Grace's medium (Life Technologies Inc.. Gaithersburg. MD). Afterwards. 10 



Ill Lipofcctin plus 90 Grace's medium arc added, mixed and incubated for 15 
minutes at room temperature Then the transfection mixture is added drop- 
wise to Sf9 insect cells ( ATCC CRL 1 711 ) seeded in a 35 mm tissue culture 
plate with I ml Grace's medium without scrum. The plate is then incubated 
for 5 hours at 27° C. The transfection solution is then removed from the plate 
and I ml of Grace's insect medium supplemented with 10% fetal calf scrum is 
added. Cultivation is then continued at 27° C for four days. 

After four days the supernatant is collected and a plaque assay is 
performed, as described by Summers and Smith, supra. An agarose gel with 
"Blue Gal" (Life Technologies Inc., Gaithcrsburg) is used to allow easy 
tdcntification and isolation of gal-cxprcssing clones, which produce blue- 
stained plaques. (A detailed description of a "plaque assay" of this type can 
also be found in the user's guide for insect cell culture and baculovirology 
distributed by Life Technologies Inc., Gaithcrsburg, page 9-10). After 
appropriate incubation, blue stained plaques are picked with the tip of a 
micropipets (e.g.. Eppendorf). The agar containing the recombinant viruses 
is then resuspended in a microcentrifuge tube containing 200 \x\ of Grace's 
medium and the suspension containing the recombinant baculovin.s is used to 
infect SW cells seeded in 35 mm dishes. Four days laf.-r the supernatants of 
these culture dishes are harvested and then they arj stored at 4° C 

To verify- the expression of 'he TNF receptor gene Sf9 cells are grown 
in Grace's medium supplemented with I0°o heat-inactivated FBS The cells 
are infected w ith the recombinant baculovirus at a multiplicity of infection 
("MOD of about 2. If radiolabeled proteins are desired. 6 hours later the 
medium is removed and is replaced with SF900 11 medium minus methionine 
and cysteine (available from Life Technologies Inc.. Rockville. MD). After 42 
hours. 5 uCi of "S-methionine and 5 ^Ci >? S-cysteine (available from 
Amersham) arc added. The cells are further incubated for 16 hours and then 
are harvested by centnfugation. The proteins in the supernatant as well as the 



intracellular proteins arc analyzed by SDS-PAGE followed by 
autoradiography (if radiolabeled). 

Microscqucncing of the amino acid sequence of the amino terminus of 
punficd protein may be used to determine the amino terminal sequence of the 
mature form of the TNF receptor protein. 

Example 3: Cloning and Expression of TSFR-S. -6a and -6P in 

Mammalian Cells 

A typical mammalian expression vector contains the promoter clement, 
which mediates the initiation of transcription of mRNA. the protein coding 
sequence, and signals required for the termination of transcription and 
poiyadcnylation of the transcript. Additional elements include enhancers. 
Kozak sequences and interv ening sequences flanked by donor and acceptor 
sites for RNA splicing. Highly efficient transcription can be achieved with the 
early and late promoters from SV40. the long terminal repeats (LTRs) from 
Retroviruses, e.g.. RSV. HTLVI. HI VI and the early promoter of the 
cytomegalovirus (CMV). However, cellular elements can also be used (e.g.. 
the human actin promoter). Suitable expression vectors for use in practicing 
the present invention include, for example, vectors such as pSVL and pMSG 
(Pharmacia. I ppsa'a. Sweden). pRSVcat (ATCC 37152). pSVZdhfr (ATCC 
37146) and pBCI2MI (ATCC 67109). Mammalian host cells that could be 
used include, human Hela. 293. H9 and Jurkat cells, mouse NIH3T3 and C127 
cells. Cos 1. Cos 7 and CV1. quail QC 1-3 cells, mouse L cells and Chinese 
hamster ovary (CHO) cells. 

Alternatively, the gene can be expressed in stable cell lines that contain 
the gene integrated into a chromosome. The co-transfection with a selectable 
marker such as dhfr. gpt. neomycin, hygromycin allows the identification and 
isolation of the transfected cells. 
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a termination codon and polyadcnylation signal arranged so that a cDNA can 
be conveniently placed under expression control of the CM V promoter and 
opcrably linked to the SV40 intron and the polyadcnylation signal by means of 
restriction sites in the polylinkcr. The HA tag corresponds to an epitope 
derived from the influenza hemagglutinin protein described by Wilson ct a!.. 
Ceil 37: 767 (1984). The fusion of the HA tag to the target protein allows 
easy detection and recovery of the recombinant protein with an antibody that 
recognizes the HA epitope. pcDNAHI contains, in addition, the selectable 
neomycin marker. 

A DNA fragment encoding the complete TNF receptor polypeptide is 
cloned into the polylinkcr region of the vector so that recombinant protein 
expression is directed by the CMV promoter. The plasmid construction 
strategy is as follows. The TNF receptor cDNA of a deposited clone is 
amplified using primers that contain convenient restriction sites, much as 
described abov e for construction of vectors for expression of a TNF receptor 
in E. eoii. Suitable primers include the following, which are used in this 
example. The 5' primer for TNFR-5, containing the underlined EcoRI site, has 
the following sequence: 

5* CGCGAATTCCGCCATCATGGCCCGGATCCCCAAG 3' (SEQ ID 
NO:35). The 3* primer, containing the underlined Xbal site, has the following 
sequence: 5' GCGTCTAGAGTAATGAGAAGAGGCAGG 3' (SEQ ID 
NO:36>. 

The PCR amplified DNA fragment and the vector, pcDNAI/Amp, are 
digested with Xbal and EcoRI and then ligated. The ligation mixture is 
transformed into E. cod strain SURE (available from Stratagcne Cloning 
Systems. 1 1099 North Torrey Pines Road, La Joila, CA 92037), and the 
transformed culture is plated on ampicillin media plates which then are 
incubated to allow growth of ampicillin resistant colonies. Plasmid DNA is 
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isolated from resistant colonics and examined by restriction analysis or other 
means for the presence of the fragment encoding the TNFR-5 polypeptide 

For expression of recombinant TNFR-5, COS cells arc transfected with 
an expression vector, as described above, using DEAE-DEXTRAN, as 
described, for instance, in Sambrook ct al.. Molecular Cloning: a Laboratory 
Manual Cold Spring Laboratory Press, Cold Spring Harbor. New York 
(1989). Cells arc incubated under conditions for expression of TNFR-5 by the 
vector. 

Expression of the pTNFR-5-HA fusion protein is detected by 
radiolabeling and immunoprccipitation, using methods described in, for 
example Harlow et al.. Antibodies: A Laboratory Manual 2nd Ed. ; Cold Spr ng 
Harbor Laboratory Press. Cold Spring Harbor, New York (1988). To this end, 
two days after transfection. the cells are labeled by incubation in media 
containing • 5 S-cystcine for 8 hours. The cells and the media arc collected, and 
the cells are washed and the lyscd w ith detergent-containing RIPA buffer: 150 
mM NaCl. 1% NP-40, 0.1% SDS, 1% NP-40, 0.5% DOC, 50 mM TRIS, pH 
7.5. as described by Wilson et al. cited above. Proteins are precipitated from 
the cell lysate and from the culture media using an HA-specific monoclonal 
antibody. The precipitated proteins then arc analyzed by SDS-PAGE and 
autoradiography. An expression product of the expected size is seen in the 
cell lysate, which is not seen in negative controls. 

Example 3(b): Cloning and Expression in CHO Cells 

The vector pC4 is used for the expression of TNFR-5, -6a and -6p 
polypeptides. Plasmid pC4 is a derivative of the plasmid pSV2-dhfr (ATCC 
Accession No. 37146). The plasmid contains the mouse DHFR gene under 
control of the SV40 early promoter. Chinese hamster ovary- or other cells 
lacking dihydrofolate activity that are transfected with these plasmids can be 
' "selected by growing the cells in a selective medium (alpha minus MEM, Life 
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Technologies) supplemented with the chcmothcrapcutic agent methotrexate. 
The amplification of the DHFR genes in cells resistant to methotrexate (MTX) 
has been well documented (see, e.g., Alt, F. W., Kcllcms, R. M„ Bcrtino, J. R., 
and Schimkc, R. T., 1978. J. BioL Chem. 1 357-1 370, Hamlin, J. L. and 
Ma, C. 1990. Biochem. et Biophys. Acta. 1097: 107-143, Page, M. J. and 
Sydenham. M. A. 1 99 1 , Biotechnology 9:64-68). Cells grown in increasing 
concentrations of MTX develop resistance to the drug by overproducing the 
target enzyme. DHFR, as a result of amplification of the DHFR gene. If a 
second cenc is linked to the DHFR gene, it is usually co-amplified and over- 
expressed. It is known in the art that this approach may be used to develop 
cell lines carrying more than 1,000 copies of the amplified gcnc(s). 
Subsequently, when the methotrexate is withdrawn, cell lines arc obtained 
which contain the amplified gene integrated into one or more chromosome(s) of 
the host cell. 

Piasmid pC4 contains for expressing the gene of interest the strong 
promoter of the long terminal repeat (LTR) of the Rouse Sarcoma Virus 
(Cullen. et aL Molecular and Cellular Biology. March 1985:438-447) plus a 
fragment isolated from the enhancer of the immediate early gene of human 
cytomegalovirus (CMV) (Boshart et aL Cell 47:521-530 (1985)). 
Downstream of the promoter are the following single restriction enzyme 
cleavage sues that allow the integration of the genes: BamHl, Xba I, and 
AspTl 8. Behind these cloning sites the piasmid contains the 3* intron and 
polyadenylation site of the rat preproinsulin gene. Other high efficiency 
promoters can also be used for the expression, e.g., the human B-actin 
promoter, the SV40 early or late promoters or the long terminal repeats from 
other retroviruses, e.g., HIV and HTLVI. Clontech's Tet-Off and Tet-Ch. 
gene expression systems and similar systems can be used to express the TNF 
receptor polypeptide in a regulated way in mammalian cells (Gossen. M., & 
Bujard. H 1992, Proc. Natl. Ac ad. Sci. USA #9:5547-555 1 ). For the 



polyadcnylation of the mRNA other signals, e.g., from the human growth 
hormone or globin genes can be used well. Stable cell lines carrying a gene of 
interest integrated into the chromosomes can also be selected upon co- 
transfection with a selectable marker such as gpt, G41 8 or hygromycin. It is 
advantageous to use more than one selectable marker in the beginning, e.g., 
G41S plus methotrexate. 

The plasmid pC4 is digested with the restriction enzymes appropriate 
for the specific primers used to amplify the TNF receptor of choice as 
outlined below and then dcphosphorylatcd using calf intestinal phosphates by 
procedures known in the art. The vector is then isolated from a 1% agarose 
gel 

The DNA sequence encoding the TNF receptor polypeptide is 
amplified using PCR oligonucleotide primers corresponding to the 5* and 3* 
sequences of the^ked portion of the gene. The 5* primer for TNFR-5 
containing the underlined Xbal site, has the following sequence: 
5* CGCTCTAGACCGCCATCATGGCCCGGATCCCCAAG 3' (SEQ ID 
NO:37). The 5* primer for TNFR-6a and -6p containing the underlined 
BamHI site, has the following sequence: 

5" CGC GG ATCC GCCATCATGAGGGCGTGGAGGGGCCAG 3' (SEQ 
ID NO:31 ). The 3' primer for TNFR-5. containing the underlined Xbal site, 
has the following sequence: 

5' GCG TCTAGA CTAGTAA TGAOAAGAGGCAGG 3' (SEQ ID NO:38). 
The 3' pnmer for TNFR-6u has the sequence 

5" CGCGGTACCCTCTT rCACTGCAAGTG 3' (SEQ ID NO:33) 
containing the underlined Asp718 restriction site. The 3' primer for TNFR-6P 
has the sequence 5* CGC GGTACC CTCCTCAGCTCCTGCAGTG 3' (SEQ 
ID NO:34) containing the underlined Asp718 restriction site. 



The amplified fragment is digested with the cndonuclcascs which will 
cut at the engineered restriction sitc(s) and then purified again on a 1% agarose 
gel. The isolated fragment and the dcphosphorylatcd vector are then ligatcd 
with T4 DNA hgasc. E coli HBIOI or XL-1 Blue cells arc then transformed 
and bacteria arc identified that contain the fragment inserted into plasmid pC4 
using, for instance, restriction enzyme analysis. 

Chinese hamster ovary celts lacking an active DHFR gene are used for 
transfection. Five-jig of the expression plasmid pC4 is cotransfectcd with 0.5 
jig of the plasmid pSVnco using lipofectin (Feigner ct a!., supra). The plasmid 
pSV2-neo contains a dominant selectable marker, the neo gene from Tn5 
encoding an enzyme that confers resistance to a group of antibiotics including 
G41 8. The cells are seeded in alpha minus MEM supplemented with I mg'ml 
G418. After 2 days, the cells are trypsinized and seeded in hybridoma cloning 
plates (Greiner. Germany) in alpha minus MEM supplemented with 10, 25, or 
50 ng ml of mctothrexate plus 1 mg'ml G418. After about 10-14 days single 
clones are trypsinized and then seeded in 6-well petri dishes or 10 ml flasks 
using different concentrations of methotrexate (50 nM. 100 nM, 200 nM, 400 
nM. 800 nM). Clones grow ing at the highest concentrations of methotrexate 
are then transferred to new 6-wcll plates containing even higher concentrations 
of methotrexate (1 |iM, 2 *iM, 5 uM, 10 mM, 20 mM). The same procedure 
is repeated until clones are obtained which grow at a concentration of 100 - 
200 mM. Expression of the desired gene product is analyzed, for instance, by 
SDS-PAGE and Western blot or by reversed phase HPLC analysis. 

Example 4: Tissue distribution of TSF receptor mRSA expression 

Northern blot analysis is carried out to examine TNFR-5. -6a or -6(J 
gene expression in human tissues, using methods described by, among others, 
Sambrook ct ut.. cited above. A cDNA probe containing the entire nucleotide 
.sequence of a TNK receptor protein (SEQ ID NO:l, 3 or 5) is labeled with K ? 
using the r«//pnme IM DNA labeling system (Amersham Life Science). 



alcording to manufacturer's instructions. After labeling, the probe is purified 
^iiig a CHROMA SPIN- 100™ column (Clontech Laboratories, Inc.), 
according to manufacturer's protocol number PT12C0-1. The purified labeled 
probe is then used to examine various human tissues for TNF receptor mRNA. 

Multiple Tissue Northern (MTN) blots containing various human 
tissues (H) or human immune system tissues (IM) are obtained from Clontech 
and are examined with the labeled probe using ExprcssHyb™ hybridization 
solution (Clontech) according to manufacturer's protocol number PT1 190-1. 
Following hybridization and washing, the blots are mounted and exposed to 
film at -70° C overnight, and films developed according to standard 
procedures. 

It will be clear that the invention may be practiced otherwise than as 
particularly described in the foregoing description and examples. Numerous 
modifications and variations of the present invention are possible in light of 
the above teachings arwl, theieforc, are within the scope of the appended 
claims. 

The entire disclosure of all publications (including patents, patent 
applications, journal articles, laboratory manuals, books, or other documents) 
cited herein are hereby incorporated by reference. 
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SEQUENCE LISTING 



il) GENERAL- INFORMATION: 

(i) APPLICANT: WEI , YING-FEI 
NI, JIAN 
FAN , PENG 
EBNER. REINHARD 
YU, GUO- LIANG 
GENTZ , REINER 
RUBEN , STEVEN 

(ii) TITLE OF INVENTION: TUMOR NECROSIS FACTOR RECEPTORS 5, 6 

ALPHA AND 6 BETA 



(iii) NUMBER OF SEQUENCES: 38 
(iv> CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HUMAN GENOME SCIENCES , 

(B) STREET: 9410 KEY WEST AVENUE 

(C) CITY: ROCKVILLE 

(D) STATE: MD 

(E) COUNTRY: US 
t F) ZIP: 20(350 



INC. 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C. OPERATING SYSTEM: PC - DOS /MS - DOS 

<D> SOFTWARE : Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
(Ai APPLICATION NUMBER : 
•3) FILING DATE: 

(C) CLASSIFICATION: 

/iii* ATTORNEY AGENT INFORMATION: 

•A) NAME: BROOKES , ANDERS A 

;B) REGISTRATION NUMBER : 36,373 

;C) REFERENCE/ DOCKET NUMBER: PF3 41 



rSLECOMMUNICATICN INFORMATION : 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX : (301) 309-8512 



■2 ) INFORMATION FOR SEQ ID NO : 1 : 

(il SEQUENCE CHARACTERISTICS: 

• A J LENGTH: 13 52 base pairs 
.3: TYPE: nucleic acid 
(Ci STRAND EDNES S : single 
wD; TOPOLOGY: linear 

(ii. y.l LZCULE TYPE: DNA (genomic) 



/ 



I i ■ ■ 



(XX) FEATVPE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 183 . .959 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 : 

CCTCTCCACG CGCACGAACT CAGCCAACGA TTTCTGATAG ATTTTTGGGA GTTTCACCAG 

AGATGCAAGG GGTGAACGAG CGCTTCCTAC CGTTAGGGAA CTCTGGGGAC AGAGCGCCCC 

GGCCGCCTGA T3GCCGAGGC AGGGTGCGAC CCAGGACCCA GGACGGCG7C GGGAACCATA 

CC ATG GCC CGG ATC CCC AAG ACC CTA AAG TT C GTC GTC GTC ATC GTC 
Met Ala Arg He Pro Lys Thr Leu Lys Phe Val Val Val He Val 
! 5 10 15 

GCG GTC CTG CTG CCA GTC CTA GCT TAC TCT GCC ACC ACT GCC CGG CAG 
A* a val Leu Leu Pro Val Leu Ala Tyr Ser Ala Thr Thr Ala Arg Gin 
20 25 30 

GAG GAA GTT CCC CAG C*G ACA GTG GCC CCA CAG CAA CAG AGG CAC AGC 
Glu Glu Val Pro Gin Gin Thr Val Ala Pro Gin Gin Gin Arg His Ser 
35 40 4:5 

TTC AAG GGG GAG GAG TGT CCA GCA GGA TCT CAT AG A TCA GAA CAT ACT 
Phe Lys Gly Glu Glu Cys Pro Ala Gly Ser His Arg Ser Glu His Thr 
50 55 60 

^GA GCC TGT AAC CCG TGC ACA GAG GGT GTG GAT TAC ACC AAC GCT TCC 
Gly A^a Cys Asn Pro Cys Thr Glu Gly Val Asp Tyr Thr Asn Ala Ser 
€5 7 0 75 

AAC AAT GAA ZZT TCT TGC TTC CCA TGT ACA GTT TGT AAA TCA GAT CAA 
Asn Asn Glu Pro Ser Cys Phe Pro Cys Thr Val Cys Lyr Ser Asp G.n 
80 85 90 ^ 

AAA CAT AAA AGT TCC TGC ACC ATG ACC AGA GAC ACA GTG TGT CAG TGT 
t vs His Lys Ser Ser Cys Thr Met Thr Arg Asp Thr /a* C>s G.n c>- 
100 HO 

AAA GAA GGC ACC TTC CGG AAT GAA AAC TCC CCA GAG ATG TGC CGG AAG 
Lys Glu Gly Thr Phe Arg Asn Glu Asn Ser Pro Glu Met Cys Arg ,ys 
115 1=0 125 

TGT AGC AGG TGC CCT AGT GGG GAA GTC CAA GTC AGT AAT TGT ACG TCC 
Cys Ser Arc Cys Pro Ser Gly Glu Val Gin Val Ser Asr. Cys Thr Ser 
130 * 135 «0 

TGC GAT GAT ATC CAG TGT GTT GAA GAA TTT GGT GCC AAT GCC ACT GTG 
Trp Asp Asp lie Gin Cys Val Glu Glu Phe Gly Ala Asn Ala Thr Val 
l4 .5 150 15S 



60 
120 
180 
227 

275 

323 

371 

419 

467 

515 

563 

611 



U *■ 



GS0 



• • • 




.XI 



GAA ACC CCA 
Glu Thr Pro 

160 

GCC CCA GCT 
Ala Pro Ala 



CCA GCT GCT 
Pro Ala Ala 



GCT GCT GAA 
Ala Ale- Glu 
210 

GCT GAA GAG 
Ala Glu Glu 
225 

TAC CTC TCA 
Tyr Leu Ser 
240 

A — T GTG TTT 
He "al Phe 



GCT 
Ala 



GCT 
Ala 



GAA 
Glu 
195 

GAG 
Glu 



ACA 
Thr 



TGC 
Cys 



GTI 
Val 



83 

GCT GAA GAr ACA ATG AAC ACC AGC CCG GGG ACT CCT 

Ala Glu riu Thr Met Asn Thr Ser Pro Gly Thr Pro 

165 I 70 175 

GAA G \G ACA ATG AAC ACC AGC CCA GGG ACT CCT GCC 
Glu Mu Thr Met Asn Thr Ser Pro Gly Thr Pro Ala 
180 190 

GAC ACA ATI ACC ACC AGC CCG GGG ACT CCT GCC CCA 
Glu -nu Met Thr Thr Ser Pro Gly Thr Pro Ala Pro 
200 205 

ACA ATG ACC ACC AGC CCG GGG ACT CCT GCC CCA GCT 
-hr Met Thr Thr Ser Pro Gly Thr Pro Ala Pro Ala 
215 220 

ATG ACC ACC AGC CCG GGG ACT CCT GCC TCT TCT CAT 
Met Thr Thr Ser Pro Gly Thr Pro Ala Ser Ser His 
230 235 

ACC ATC GTA GGG ATC ATA GTT CTA ATT GTG CTT CTG 
Thr He Val Gly He He Val Leu He val Leu Leu 
245 250 255 

TGAAAGACTT CACTGTGGAA GAAATTCCTT CCTTACCTGA 



707 



755 



803 



851 



899 



94 7 



?99 



i v v. 



AAGGTTCAGG TAGGC3CTGG CTGAGGGCGG GGGGCGCTGG AC ACT CT CTG CCCTGCCTCC 
CTCTGCTGTG TTCCCACACA CAGAAACGCC TGCCCCTGCC CCAAGTCCTG GTGTCTCCAG 
CCTGGC7CTA TC _ CCTCCT TGTGATCGTC CCATCCCCAC ATCCCGTGCA CCCCCCAGGA 
crcTaGTCTC ATCA3TCC;rT CTC CTGG AG C TGGGGGTCCA CACATCTCCC AGCCAAGTCC 
AA3AGGCAGG G C C AG TT C CT CCCATCTTCA GGCCCAGCCA GGCAGGGGGC AGTCGCCTCC 
7CAACTG3GT GACAAGGGTG AG G ATG AG AA GTGGTCACGG GATTTATTCA GCCTTGGTCA 
GA3CAGAACA CAGAGATTTT CCGTGAAAAA AAA 



1059 
1119 
1179 
1239 
1299 
1359 
1392 



; 2 1 INFORMATION FOR SEQ ID NO : 2 : 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 
(B'< TYPE: ammo acid 
(D) TOPOLOGY: linear 



MCLECVLE TYPE: prctem 
lx;.! SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



t . - j-— — - 



1 



'** "J 



✓» _ V 



TV 

& 



V. 

- ( 
t r 
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Met Ala Arg lie Pro Lya Thr Leu Lys Phe Val Val Val lie Val Ala 
! 5 10 15 

val Leu Leu Pro Val Leu Ala Tyr Ser Ala Thr Thr Ala Arg Gin Glu 



20 " 

Glu Val Pro Gin Gin Thr Val Ala Pro Gin Gin Gin Arg His Ser Phe 

40 4S 



35 



Lys Gly Glu Glu Cys Pro Ala Gly Ser 

55 60 



His Arg Ser Glu His Thr Gly 



SO 

Ala Cys 
65 



Asn Pro Cys Thr Glu Gly Val Asp Tyr Thr Asn Ala Ser Asn 

75 80 



70 



Asn Glu Pro Ser Cys Phe Pro Cys Thr Val Cys Lys Ser Asp Gin Lys 
85 90 9* 



His Lys Ser Ser Cys 
100 



Thr Met Thr Arg Asp Thr Val Cys Gin Cys Lys 



105 



110 



Glu Gly Thr Phe Arg Asn Glu Asn Ser Pro Glu Met Cys Arg Lys Cys 
115 I- 20 125 

S-r Arq Cys Pro Ser Gly Glu Val Gin Val Ser Asn Cys Thr Ser Trp 

135 140 



130 



Asp Asp He Gin Cy- Val 

14 5 150 155 



Glu Glu Phe Gly Ala Asn Ala Thr Val Glu 

160 



Thr Pro Ala Ala Glu Glu Thr Met Asn Thr Ser Pro Gly Thr Fro Ala 
165 l^O i7S 

F-c Ala Ala Glu Glu Thr Met Asn Thr Ser Pro Gly Thr Pro Ala Pro 
180 185 190 

Ala Ala Glu Glu Thr Met Thr Thr Ser Pro Gly Thr Pro Ala Pro Ala 
195 200 205 

Ala Glu Glu Thr Met Thr Thr Ser Pro Gly Thr Pro Ala Pro Ala Ala 
210 215 220 

Glu Glu Thr Met Thr Thr Ser Pro Gly Thr Pro Ala Ser Ser His Tyr 
225 230 235 240 

Gly He He Val Leu He Val Leu Leu He 



Leu Ser Cvs Thr lie Val Gly He He Val 
245 250 

Val Phe Val 

(2) INFORMATION FOR SEQ ID NO : 3 : 

' ■ ( i ) * SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1077 base pairs 



255 



Kg**" 

i 
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• 
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(B) TYPE: nucleic acid 
tC) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 2S . . 924 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

G" CTCCCTG CTCCAGCAAG GACC ATG AGG GCG CTG GAG GGG CCA GGC CTG 

Met Arg Ala Leu Glu Gly Pro Gly Leu 
1 5 



TCG CTG CTG TGC CTG GTG TTG GCG CTG CCT GCC CTG CTG CCG GTG CCG 
S«- Leu Leu Cys Leu Val Leu Ala Leu Pro Ala Leu Leu Pro Val Pro 
10 15 20 " 

GCT GTA CGC GGA GTG GCA GAA ACA CCC ACC TAC CCC TGG CGG GAC GCA 
Ala Val Arg Gly Val Ala Glu Thr Pro Thr Tyr Pro Trp Arg Asp Ala 
30 3S 40 

GAG ACA GGG GAG CGG CTG GTG TGC GCC CAG TGC CCC CCA GGC ACC TTT 
Glu Thr Gly Glu Arg Leu Val Cys Ala Gin Cys Pro Pro Gly Thr Phe 
45 50 55 

GTG CAG CGG CCG TGC CGC CGA GAC AGC CCC ACG ACG TGT GGC CCG TGT 
Val Gin Arg Pro Cys Arg Arg Asp Ser Pro Thr Thr Cys Gly Pro Cys 
60 65 ™ 

C-A CCG CGC CAC TAC ACG CAG TTC TGG AAC TAC CTG GAG CGC TGC CGC 
?-o Pro Arg His Tvr Thr Gin Phe Trp Asn Tyr Leu Glu Arg Cys Arg 
75 80 65 

-A- TGC AAC GTC CTC TGC GGG GAG CGT GAG GAG GAG GCA CGG GCT TGC 
C--s As- Val Leu Cys Gly Glu Arg Glu Glu Glu Ala Arg Ala Cys 
*' 90 ' " 95 100 105 

CAC GCC ACC CAC AAC CGT GCC TGC CGC TGC CGC ACC GGC TTC TTC GCG 
His Ala Thr His Asn Arg Ala Cys Arg Cys Arg Thr Gly Phe Phe Ala 
110 US :2 ° 

CAC GCT GGT TTC TGC TTG GAG CAC GCA TCG TGT CCA CCT GGT GCC GGC 
His Ala Gly Phe Cys Leu Glu His Ala Ser Cys Pro Pro Gly Ala Gly 
125 130 135 

GTG ATT GCC CCG GGC ACC CCC AGC CAG AAC ACG CAG TGC CAG CCG TGC 
Val lie Ala Pro Gly Thr Pro Ser Gin Asn Thr Gin Cys Gin Pro Cys 
140 145 150 




86 

CCC CCA GGC ACC TTC TCA GCC AGC AGC TCC AGC TCA GAG CAG TGC CAG 
Pro Pro Gly Thr Phe Ser Ala Ser Ser Ser Ser Ser Glu Gin Cya Gin 
I55 160 165 

CCC CAC CSC AAC TGC ACG GCC CTG GGC CTG GCC CTC AAT GTG CCA GGC 
Pro His Arg Asn Cys Thr Ala Leu Gly Leu Ala Leu Asn Val Pro Gly 
170 180 "5 

TCT TCC TCC CAT GAC ACC CTG TGC ACC AGC TGC ACT GGC TT0 CCC CTC 
iTr Ser Ser His Asp Thr Leu Cys Thr Ser Cys Thr Gly Ph. Pro Leu 
190 200 

AGC ACC AGG GTA CCA GGA GCT GA 3 GAG TGT GAG CGT GCC GTC ATC GAC 
Se- Thr Arg Val Pro Gly Ala Glu Glu Cys Glu Arg Ala Val He Asp 
205 210 215 

TTT GTG GCT TTC CAG GAC ATC TCC ATC AA^G AGG CTG CAG CGG CTG CTG 
Phe Val Ala Phe Gin Asp lie Ser He Lys Arg Leu Gin Arg Leu Leu 
220 225 230 

CAG GCC CTC GAG GCC CCG GAG GGC TGG GGT CCG ACA CCA AGG GCG GGC 
Gin Ala Leu Glu Ala Pro Glu Gly Tr P Gly Pro Thr Pro Arg Ala Gly 
235 240 245 

CSC GCG GCC TTG CAG CTG AAG CTG CGT CGG CGG CTC ACG GAG CTC CTG 
l~l Ala Ala Leu Gin Leu Lys Leu Arg Arg Arg Leu Thr Glu Leu Leu 
250 255 260 265 

GGG GCG CAG GAC GGG GCG CTG CTG GTG CGG CTG CTG CAG GCG CTG CGC 
Gly Ala Gin Asp Gly Ala Leu Leu Val Arg Leu Leu Gin Ala Leu Arg 
270 275 280 

G-G GCC AGG ATG CCC GGG CTG GAG CGG AGC GTC CGT GAG CGC TTC CTC 
val Ala Arg Met Pro Gly Leu Glu Arg Ser Val Arg Glu Arg Phe Leu 
285 290 295 

CCT GTG CAC TGATCCTGGC CCCCTCTTAT TTATTCTACA TCCTTGGCAC 
Pro Val His 
3 00 

CCCACTTGCA CTGAAAGAGG CTTTTTTTTA AATAGAAGAA ATGAGGTTTC TTAAAGCTTA 
TTTTTATAAA GCTTTTTCAT AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO : 4 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

di) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



531 

579 

627 

675 

723 

771 

819 

867 

915 

964 

1024 
1077 



5^ 
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u Arg Ala Leu Glu Gly Pro Gly Leu Ser Leu Leu Cys Leu Val Leu 
! 5 1° 

Ala Leu 

-h~ Tyr Pro Trp Arg Asp Ala Glu Thr Gly Glu Arg Leu Val 
35 <° 45 

cys Ala Gin Cys Pro Pro Gly Thr Phe Val Gin Arg Pro Cys Arg Arg 

50 55 6° 

Asp Ser Pro Thr Thr Cys Gly Pro Cys Pro Pro Arg His Tyr Thr Gin 



Met mi y ~~ * ' j^g 

Pro Ala Leu Leu Pro Val Pro Ala Val Arg Gly Val Ala Glu 
20 25 30 

Thr Pro Thr 

35 40 45 



65 



70 



75 



Phe Trp Asn Tyr Leu Glu Arg Cys Arg Tyr Cys Asn Val Leu Cys Gly 



85 



90 



Glu Arg Glu Glu Glu Ala Arg Ala Cys His Ala Thr His Asn Arg Ala 
,00 105 

Cys Arg Cys Arg Thr Gly Phe Phe Ala His Ala Gly Phe Cys Leu Glu 
115 120 1 



His Ala Ser Cys Pro Pro Gly Ala Gly Val He Ala Pro Gly Thr Pro 
130 135 140 

Ser Gin Asn Thr Gin Cys Gin Pro Cys Pro Pro Gly Thr Phe Ser Ala 
>r Ser Ser Ser Glu Gin Cys Gin Pro His Arg Asn Cys Thr Ala 



Ser Sei 



165 



170 



175 



Leu Gly Leu Ala Leu Asn Val Pro Gly Ser Ser S^r His Asp Thr Leu 
180 



185 19° 



r ys Thr Ser Cys Thr Gly Phe Pro Leu Ser Thr Arg Val Pro Gly Ala 

195 200 205 

Glu Cvs Glu Arg Ala Val He Asp Phe Val Ala Phe C-ln Asp lie 

^ 210 * 215 220 



Ser He Lys Arg Leu Gin Arg Leu Leu 



Gin Ala Leu C-lu Ala Pro Glu 



22S 230 235 2 *0 

Gly Trp Gly Pro Thr Pro Arg Ala Gly Arg Ala Ala Leu Gin Leu Lys 
245 250 255 

Leu Arg Arg Arg Leu Thr Glu Leu Leu Gly Ala Gin Asp Gly Ala Leu 
260 265 270 

Leu Val Arg Leu Leu Gin Ala Leu Arg Val Ala Arg Met Pro Gly Leu 
275 280 285 
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Glu Arg Ser Val Arg Glu Arg Phe Leu Pro Val His 
290 295 300 

(2) INFORMATION FCR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1667 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 73.. 582 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TGGCATGTCG GTCAGGCACA GCAGGGTCCT GTGTCCGCGC TGAGCCGCGC TCTCCCTGCT 

C-AG^AAGGA CC ATG AGG GCG CTG GAG GGG CCA GGC CTG TCG CTG CTG 
Met Arg Ala Leu Glu Gly Pro Gly Leu Ser Leu Leu 
1 S 10 

TGC CTG GTG TTG GCC CTG CCT GCC CTG CTG CCG GTG CCG GCT GTA CGC 
Cys Leu Val Leu Ala Leu Pro Ala Leu Leu Pro Val Pro Ala Val Arg 
15 20 25 

GGA GTG GCA GAA ACA CCC ACC TAC CCC TGG CGG OAC GCA C^G ACA GGG 
Glv Val Ala Glu Thr Pro Thr Tyr Pro Trp Arg Asp Ala Glu Thr Gly 
" 30 35 40 

GAG CGG CTG GTG TGC GCC CAG TGC CCC CCA GGC ACC TTT GTG CAG CGG 
G'u Arg Leu Val Cys Ala Gin Cys Pro Pro Gly Thr Phe Val Gin Arg 
Is SO 55 60 

—3 TGC CGC CGA GAC AGC CCC ACG ACG TGT GGC CCG TGT CCA CCG CGC 
P-c Cys Arg Arg Asp Ser Pro Thr Thr Cys Gly Pro Cys Pro Pro Arg 
65 7 0 75 

CAC TAC ACG CAG TTC TGG AAC TAC CTG GAG CGC TGC CGC TAC TGC AAC 
His Tyr Thr Gin Phe Trp Asn Tyr Leu Glu Arg Cys Arg Tyr Cys Asn 
60 85 90 

GTC CTC TGC GGG ~AG CGT GAG GAG GAG GCA CGG GCT TGC CAC GCC ACC 
Val Leu Cys Gly G a m Arg Glu Glu Glu Ala Arg Ala Cys His Ala Thr 
95 100 105 

CAC AAC CGT GCC TGC CGC TGC CGC ACC GGC TTC TTC GCG CAC GCT GOT 
H s Asn Arg Ala Cys Arg Cys Arg Thr Gly Phe Pha Ala His Ala Gly 
i-O ' H5 120 
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TTC TGC TTG GAG CAC GCA TCG TGT CCA CCT GGT 
Phe Cys Leu Glu His Ala Ser Cys Pro Pro Gly 
125 «° " 5 

CCG GGT GAG AGC TGG GCG AGG GGA GGG GCC CCC 
Pro Gly Glu Ser Trp Ala Arg Gly Gly Ala Pro 
145 150 

AGG TGT GGC AGG GGT CAG GTT GCT GGT CCC AGC 
Arg Cys Gly Arg Gly Gin val Ala Gly Pro Ser 
160 165 



GCC GGC GTG ATT GCC 
Ala Gly Val He Ala 
140 

AGG AGT GGT GGC CGG 
Arg ser Gly Gly Arg 
155 

CTT GCA CCC 
Leu Ala Pro 
170 



TG A GCT AGG A CACCAGTT CC CCTGACCCTG TTCTTCCCTC 
CCAGAACACG CAGTGCCAGC CGTGCCCCCC AGGCACCTTC 
AGAGCAGTGC CACJJCCACC GCAACTGCAC GGCCCTGGGC 
CTCTTCCTCC CATGACACCC TGTGCACCAG CTGCACTGGC 
ACCAGGTGAG C CAG AGG CCT GAGGGGGCAG CACACTGCAG 
CACTCCTGCC CCTGCACGTG CATCTAGCCT GAGGCATGCC 
CACAGTGGAT TTGAGGGGTC AGGGGTCCCT CCACTAGATC 
AG3GGTGGCT GAGAATTT GG ATCTGAGCCA GGGCACAGCC 
AAGTGGGCAG CAATCTCCTA ACTGCCCGAG GGGAAGGTGG 
AAACCGAGGC CTGATGGTAA CTCTCCTAAC TGCCTGAGAG 
ACA7GGGGAA ACCGAG3CCC AATGTTAACC ACTGTTGAGA 
C Z CTTAACAT CAAGTCAGGT CCGGTCCATC TGCAGGTCCC 
CdASGAGCCC CAAGCCCTTG CCTGGGCCCC CTTGCCTCTT 
CGCTCCTGCC CCCTAGGCCT TTGCTCCAGC TCTCTGACCG 
AGTCCCCATC GTTGCACTGC CCTCTCCAGC ACGGCTCACT 
TGCAAACCCC CCGAGTGGGG CCCAGAAAGC AGGGTA r CTG 
TGGGTGAAAT GATCGGACCG CTGCCTCCCC ACCCCACTGC 
GT3CCGTCAT CGACTTTGTG GCTTTCCAGG ACATCTCCAT 
GGCCC 



CTGGCTGCAG GCACCCCCAG 
TCAGCCAGCA GCTCCAGCTC 
CTGGCCCTCA ATGTGCCAGG 
TTCCCCCTCA GCACCAGGGT 
GCCAGGCCCA CTTGTGCCCT 
AGCTGGCTCT GGGAAGGGGC 
CCCACCAAGT CTGCCCTCTC 
TCCCCTGGAG AGCTCTGGGA 
CTGGCTCCTC TGACACGGGG 
GAAGGTGGCT GCCTCCTCTG 
AGTCACAGGG GGAAGTGACC 
AACTCGCCCC TTCCGATGGC 
GCAGCCAAGG TCCGAGTGGC 
AAGGCTCCTG CCCCTTCTCC 
GCACAGG^AT TTCTCTCTCC 
GCAGCCCCCG CCAGTGTGTG 
AGGAGCTGAG G AGTGTG AGC 
CAAGAGGAGC GGCTGCTGCA 



492 

540 

582 

€42 
702 
762 
822 
882 
942 
1002 
1062 
1122 
1182 
1242 
1302 
1362 
1422 
1482 
1542 
1602 
1662 
1667 



INFO FIXATION FOR SEQ ID NO : 6 : 

'" i; SEQUENCE CHARACTERISTICS : 

(A/ LENGTH: 170 ammo acids 
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(B) TYPE : amino acid 
(D) TOPOLOGY ; linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Arg Ala Leu Giu Gly Pro Gly Leu Ser Leu Leu Cys Leu Val Leu 
15 10 15 

Ala Leu Pro Ala Leu Leu Pro Val Pro A*a Val Arg Gly Val Ala Gin 
ZO 25 30 

Thr Pro Thr Tyr Pro Trp Arg Asp Ala Glu Thr Gly Glu Arg Leu Val 
35 40 45 

Cys Ala Gin Cys Pro Pro Gly Thr Phe Val Gin Arg Pro Cys Arg Arg 
50 SS 60 

Asp Ser Pro Thr Thr Cys Gly Pro Cys Pro Pro Arg His Tyr Thr Gin 
65 70 75 80 

Phe Trp Asn Tyr Leu Glu Arg Cys Arg Tyr Cys Asn Val Leu Cys Gly 
8S 90 95 

Glu Arg Glu Glu Glu Ala Arg Ala Cys His Ala Thr His Asn Arg Ala 
100 105 HO 

Cys Arg Cys Arg Thr Gly Phe Phe Ala His Ala Gly Phe Cys Leu Glu 
115 120 125 

His Ala Ser Cys Pro Pro Gly Ala Gly Val He Ala Pro Gly Glu Ser 
130 135 140 

Trp Ala Arg Gly Gly Ala Pro Arg Ser Gly Gly Arg Arg Cys Gly Arg 
145 150 155 160 

Gly Gin Val Ala Gly Pro Ser Leu Ala Pro 
165 170 



52 ) INFORMATION FCR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLoGY: linear 

(ii) MOLECULE TYPE: protein 



SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
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Mtt Gly Leu Ser Thr Val Pro Asp Leu 
X 5 10 1S 



Leu Leu Pro Leu Val Leu Leu 
10 1S 

Glu Leu Leu Val Gly lie Tyr Pro Ser Gly Val lie Gly Leu Val Pro 
20 =5 30 

His Leu Gly Asp Arg Glu Ly3 Arg Asp Ser Val Cys Pro Gin Gly Lys 
35 40 4S 

Tyr lie His Pro Gin Asn Asn Ser II* Cys Cys Thr Lys Cys His Lys 
50 55 60 

Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp 

Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 
85 90 95 

Arg His Cys Leu Ser Cys Ser Lys Cyr Arg Lys Glu Met Gly Gin Val 
100 105 11° 

Glu lie Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
115 120 125 

Lys Asn Gin T/r Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 
130 I" 140 

Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 
150 155 160 



145 

Lys Gin Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 
165 17 0 I 75 

Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 
180 185 190 

Lys Leu Cvs Leu Pro Gin He Glu Asn Val Lys Gly Thr Glu Asp Ser 
195 200 205 

Gly Thr Thr Val Leu Leu Pro Leu Val He Phe Phe Gly Leu Cys Leu 
210 215 220 

Leu Ser Leu Leu Phe He Gly Leu Met Tyr Arg Tyr Gin Arg Trp Lys 
22* 230 235 240 

Ser Lys Leu Tyr Ser lie Val Cys Gly Lys Ser Thr Pro Glu Lys Glu 
245 250 255 

Gly Glu Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn Pro Ser 
260 265 270 

Phe Ser Pro Thr Pro Gly Phe Thr Pro Thr Leu Gly Phe Ser Pro Val 
. * 275 280 "85 



"""" J siwJ*^ 



2££ 



*>5 



o 
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Pro Ser Ser Thr Phe Thr Ser Ser Ser Thr Tyr Thr Pro Gly Asp Cyg 
290 295 300 

Pro Asn Phe Ala Ala Pro Arg Arg Glu Val Ala Pro Pro Tyr Gin Gly 
305 310 315 320 

Ala Asp Pro lie Leu Ala Thr Ala Leu Ala Ser Asp Pro He Pro Asn 
325 330 335 

Pro Leu Gin Lys Trp Glu Asp Ser Ala His Lys Pro Gin Ser Leu Asp 

345 350 



340 



Thr Asp Asp P 
35S 



ro Ala Thr Leu Tyr Ala Val Val Glu Asi Val Pro Pro 



360 



365 



Leu Arg Trp Lys Glu Phe Val Arg Arg Leu Gly Leu Ser Asp His Glu 

370 375 380 

He Asp Arg Leu Glu Leu Gin Asn Gly Arg Cys Leu Arg Glu Ala Gin 

385 390 395 400 



Tyr Ser Met Leu Ala Thr Trp Arg Arg *rg Thr Pro Arg Arg Glu Ala 
405 410 41S 

Th- Leu Glu Leu Leu Gly Arg Val Leu Arg Asp Met Asp Leu Leu Gly 
420 425 430 

Cys Leu Glu Asp He Glu Giu Ala Leu Cys Gly Pro Ala Ala Leu Pro 
435 440 445 



wm 



r « * - *- 



r. .S 



Pro Ala Se: Leu Leu Arg 

4=3 455 



INFOPJ^ATICN FOR SEQ ID NO : 8 : 



{ i SEC VENCE CHARACTER I ST I CS : 

.A) LENGTH: 461 amino acids 
(3) TYPE: anino acid 
, C> STRANDEDNESS : single 
D) TOPOLOGY: linear 



f _ > MOLECULE TYPE: protein 



'xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



^ t J* I ,. 



Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 
15 io 15 



Trp Ala Ala Ala His Ala Leu Pro Ala Gin Val Al* Phe Thr Pro Tyr 



20 



25 



30 



94 

He Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu Glu Ser Ser Ala Ser 
325 330 335 

Ala Leu Asp Arg Arg Ala Pro Thr Arg Asn Gin Pro Gin Ala Pro Gly 
340 345 350 

Val Glu Ala Ser Gly Ala Gly Glu Ala Arg Ala Ser Thr Gly Ser Ser 
355 360 365 



1 Asn Val Thr Cy3 He 
380 

His Ser Ser Gin Cys Ser Ser Gin 

395 400 

Ala Ser Ser Thr Met Gly Asp Thr Asp Ser Ser Pro Ser Glu Ser Pro 



Asp Ser Ser Pro Gly Gly His Gly Thr Gin Va 

370 375 *** 

Val Asn Val Cys Ser Ser Ser Asp 

385 390 395 400 



405 



410 415 



Lv 3 Aso Glu Gin Val Pro Phe Ser Lys Glu Glu Cys Ala Phe Arg Ser 

420 425 430 

Gin Leu Glu Thr Pro Glu Thr Lou Leu Gly Ser Thr Glu Glu Lys Pro 
435 440 445 



Leu Pro Leu Gly Val Pro Asp Ala Gly Met Lys Pro Ser 
450 



455 460 



INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: 3EQ ID NO : 9 : 

Met Gly Ala Gly Ala Thr Gly Arg Ala Met Asp Gly Pro Arg Leu Leu 
1 5 10 15 

Leu Leu Leu Leu Leu Gly Val Ser Leu Gly Gly Ala Lys Glu Ala Cys 
20 25 30 

Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala Cys Asn 
35 40 45 

Leu Gly Glu Gly Val Ala Gin Pro Cys Gly Ala Asn Gin Thr Val Cys 
50 55 60 




'•ft 

■ ^ 



If 

"tea 



0 



96 



Gly 


Ser 


Ala 
355 


Gly 


Asp 


Thr 


Trp 


Arg 
360 


His 


Leu 


Ala 


Gly 


Glu 
365 


Leu 


Gly Tyr 


Gin 


Pro 
3~0 


Glu 


His 


He 


Asp 


Ser 
375 


Phe 


Thr 


His 


Glu 


Ala 
380 


Cys 


Pro 


Val 


Arg 


Ala 
385 


Leu 


Leu 


Ala 


Ser 


Trp 
39C 


Ala 


Thr 


Gin 


Asp 


Ser 
395 


Ala 


Thr 


Leu 


Asp 


Ala 
400 


Leu 


Leu 


Ala 


Ala 


Leu 
405 


Arg 


Arg 


He 


Gir» 


Arg 
410 


Ala 


Asp 


Leu 


Val 


Glu 
415 


Ser 


Leu 


Cys 


Ser 


Glu 
420 


Ser 


Thr 


Ala 


Thr 


Ser 
425 


Pro 


Val 













iZ) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



f 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Arg Leu Pro Arg Ala Ser Ser Pro Cys Gly Leu Ala Trp Gly Pro 



10 



Leu Leu Leu Gly Leu Ser Gly Leu Leu Val Ala Ser Gin Pro Gin Leu 
20 25 30 

val ?ro Pro Tyr Arg He Glu Asn Gin Thr Cys Trp Asp Gin * ^p Lys 

40 45 

Glu Tvr Tvr Glu Pro Met His Asp Val Cys Cys Ser Arg Cys Pro Pro 
SO * 55 60 

Gly Glu Phe Val Phe Ala Val Cys Ser Arg Ser Gin Asp Thr Val Cys 
65 70 75 80 

Lys Thr Cys Pro His Asn Ser Tyr Asn Glu His Trp Asn His Leu Ser 
95 90 95 

Thr Cys Gin Leu Cys Arg Pro Cys Asp He Val Leu Gly Phe Glu Glu 
100 105 HO 

Val Ala Pro Cys Thr Ser Asp Arg Lys Ala Glu Cys Arg Cys Gin Pro 
US 120 125 



97 

Gly Met Ser Cys Val Tyr Leu Asp Asn Glu Cys Val His Cys Glu Glu 
130 "5 "0 

Glu M 3 Leu val Leu Cys Gin Pro Gly Thr Glu Ala Glu Val Thr Asp 
14S ISO 155 l r '° 

Glu He Met Asp Thr Asp Val Asn Cys Val Pro Cys Lys Pro Gly His 
165 "0 1" 

Phe Gin Asn Thr Ser Ser Pro Arg Ala Arg Cys Gin Pro His Thr Arg 
180 185 I 90 

Cys 31u lie Gin Gly Leu Val Glu Ala Ala Pro Gly Thr Ser Tyr Ser 



195 200 



Asp Thr He Cys Lys A^n Pro Pro Glu Pro Gly Ala Met Leu Leu Leu 

210 215 2=° 

Ala Leu Leu Ser Leu Val Leu Phe Leu Leu Phe Thr Thr Val Leu 

22S ' 230 235 240 

Ala Cvs Ala Trp Met Arg His Pro Ser Leu Cys Arg Lys Leu Gly Thr 

245 250 255 

Glu Gly Glu Glu Ser Pro Pro Cys Pro Ala 



Leu Leu Lys Arg His Pro Glu Gly 
2S0 



265 270 



Pro Arg Ala Asp Pro His Phe Pro Asp Leu Ala Glu Pro Leu Leu Pro 
275 280 285 



Met Ser Gly Asp Leu Ser Pro Se 



r Pro Ala Gly Pro Pro Thr Ala Pro 



190 295 

Se: 



300 



- -eu Glu Glu Val Val Leu Gin Gin Gin Ser Pro Leu Val Gin Ala 
30 ; " 310 315 320 

Arg Glu Leu Glu Ala Glu Pro Gly Glu His Gly Gin Val Ala His Gly 



325 



330 



335 



Ala ;sn Gly He His Val Thr Gly Gly Ser Val Thr Val Thr Gly Asn 
340 345 350 

lie Tyr He Tyr Asn Gly Pro Val Leu Gly Gly Thr Arg Gly Pro Gly 



355 360 



365 



31u Pro Pro Tyr Pro Thr Pro Glu Glu Gly 



Asp Pro Pro Ala Pro Fro Gl 

370 375 380 



Ala Pro Gly Pro Ser Glu Leu Ser Thr Pro Tyr Gin Glu Asp Gly Lys 
385 390 



395 400 



Ala Trp His Leu Ala Glu Thr Glu Thr Leu Gly Cys Gin Asp Leu 
40S 410 415 



) I N r C P.MA7 ION FOR SEQ 10 NO : 1 1 : 



98 



ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

t ii > MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Met Leu Gly He Trp Thr Leu Leu Pro Leu Val Leu Thr Ser Val Ala 
15 10 15 

Arg Leu Ser Ser Lys Ser Val Asn Ala Gin Val Thr Asp He Asn Ser 
20 25 30 

Lys Gly Leu Glu Leu Arg Lys Thr Val Thr Thr Val Glu Thr Gin Asn 



Leu Glu Gly Leu His His Asp Gly Gin Phe Cys His Lys Pro Cys Pro 
50 55 60 

Pro Gly Glu Arg Lys Ala Arg Asp Cys Thr Val Asn Gly Asp Glu Pro 
65 70 75 80 

Asp Cys Val Pro Cys Gin Glu Gly Lys Glu Tyr Thr Asp Lys Ala His 
85 90 95 

Phe Ser Ser Lys Cys Arg Arg Cys Arg Leu Cys Asp Glu Gly -is Gly 
100 105 HO 

Leu Glu Val Glu He Asn Cys Thr Ai-g Thr Gin Asn Thr Lys Cys Arg 
115 120 125 

Cys Lvs Pro Asn Phe Phe Cys Asn Ser Thr Val Cys Glu His Cys Asp 
130 135 140 

Pro Cys Thr Lys Cys Glu His Gly He He Lys Glu Cys Thr Leu Thr 

155 160 



145 



150 



Ser Asn Thr Lys Cys Lys Glu Glu Gly Ser Arg Ser Asn Leu Gly Trp 
165 1 ? 0 I 75 

Leu Cys Leu Leu Leu Leu Pro He Pro Leu He Val Trp Val Lys Arg 
180 185 190 

Lys Glu Val Gin Lys Thr Cys Arg Lys His Arg Lys Glu Asn Gin Gly 
195 200 205 



Ser His Glu Ser Pro Thr Leu Asn Pro Glu Thr Val Ala He Asn Leu 
210 215 220 





L" * 




I 

L 



A- 



100 



He Thr Ala Aim Ala Glu Cys Ala Cys Arg Asn Gly Trp Glh Cys 



105 



110 



Arg Asp Lys Glu Cys Thr Glu Cys Asp Pro Leu Pro Asn Pro Ser Leu 

120 125 



Thr Ala Arg Ser Ser Gin Ala Leu Ser Pro His Pro Gin Pro Thr His 

135 I 40 



L^u Pro Tyr Val Se 



r Glu Met Leu Glu Ala Arg Thr Ala Gly His Met 



ISO 



155 



160 



Gin Leu Pro Ala Arg Thr Leu Ser Thr 



170 



175 



Gin Thr Leu Ala Asp Phe Arg 
165 

His Trp Pro Pro Gin Arg Ser Leu Cys Ser Ser Asp Phe lie Arg He 
180 185 

Leu Val He Phe Ser Gly Met Phe Leu Val Phe Thr Leu Ala Gly Ala 

195 205 
Leu Phe Leu His Gin Arg Arg Lys Tyr Arg Ser Asn Lys Gly Glu Ser 



210 



215 



220 



Pro 

225 



val Glu Pro Ala Glu Pro Cys Arg Tyr Ser Cys Pro Arg Glu Glu 

235 240 



230 



Glu Gly Ser Thr lie Pro lie Gin Glu Asp Tyr Arg Lys Pro Glu Pro 
24S 250 255 

Ala Cys Ser Pro 
260 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 595 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: single 
iO) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



txi) SEQUENCE DESCRIPTION: SLQ ID NO: 13: 

Met Arg Val Leu Leu Ala Ala Leu Gly Leu Leu Phe Leu Gly Ala Leu 



10 



15 



Arq Ala Phe Pro Gin Asp Arg Pro Phe Glu Asp Thr Cys His Gly Asn 
20 25 30 



-. 

c. * "Si 



. 4 



r 

L 





Pro Ser Hia Tyr Tyr 
35 

Pro Met Gly Leu Phe 
50 

Cys Arg Lys Gin Cys 
65 

Cys Thr Ala Cys Val 
85 

Pro Cys Ala Trp Asn 
100 

Phe Cys Ser Thr Ser 
115 

Ser Val Cys Pro Ala 
130 

Lys Asn Thr Val Cys 
145 

Ala Ser Pro Glu Asn 
165 

Ala Lys Pro Thr Pro 
180 

Pro Val Arg Gly Gly 
195 

Thr Arg Ala Pro Asp 
210 

Pro Gly Leu Ser Pro 
225 

Arg Lys Gin Cys Glu 
245 

Thr Ala Cys Val Ser 

260 

Cys Ala Trp Asn Ser 
275 

Cys Ala Thr Ser Ala 
290 



101 

Asp Lys Ala Val Arg Arg 
40 

Pro Thr Gin Gin Cys Pro 
55 

Glu Pro Asp Tyr Tyr Leu 
70 75 

Thr Cys Ser Arg Asp Asp 
90 

Ser Ser Arg Val Cys Glu 
105 

Ala Val Asn Ser Cy9 Ala 
120 

Gly Met He Val Lys Phe 
13S 

Glu Pro Ala Ser Pro Gly 
150 155 

Cys Lys Glu Pro Ser Ser 
170 

Val Ser Pro Ala Thr Ser 
185 

Thr Arg Leu Ala Gin Glu 

200 

Ser Pro Ser Ser Val Gly 
215 

Thr Gin Pro Cys Pro Glu 
230 235 

Pro Asp T> r Tyr Leu Asp 
250 

Cys Ser Arg Asp Asp Leu 
265 

Ser Arg Thr Cys Glu Cys 
280 

Thr Asn Ser Cys Ala Arg 
295 



Cys Cys Tyr Arg Cys 
45 

Gin Arg Pro Thr Asp 
60 

Asp Glu Ala Asp Arg 
80 

Leu Val Glu Lys Thr 
95 

Cys Arg Pro Gly Met 
110 

Arg Cys Phe Phe His 
125 

Pro Gly Thr Ala Gin 
140 

Val Ser Pro Ala Cys 
160 

Gly Thr He Pro Gin 
175 

Ser Ala Ser Thr Met 
190 

Ala Ala Ser Lys Leu 
205 

Arg Fro Ser Ser Asp 
220 

Gly Ser Gly Asp Cys 
24C 

Glu Ala Gly Arg Cys 
255 

Val Glu Lys Thr Pro 
270 

Arg Pro Gly Met He 
285 

Cys Val Pro Tyr Pro 
300 



He Cys Ala Ala Glu Thr Val Thr Lys Pro Gin Asp Met Ala Glu Lys 
305 . 310 315 320 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH. 277 amino acids 

(B) TYPE: amino acid 

(C) STRAOTEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE LESCRIPTION: SEQ ID NO: 14: 

Met Val Arg Leu Pro Leu Gin Cys Val Leu Trp Gly Cys Leu Leu Thr 
1 5 10 1S 

Ala Val His Pro Glu Pro Pro Thr Ala Cys Arg Glu Lys Gin Tyr Leu 
20 25 30 



lie Asn Ser Gin Cys Cys Ser Leu Cys Gin Pro Gly Gin Lys Leu Val 
35 40 45 

Ser Asp Cys Thr Glu Phe Thr Glu Thr Glu Cys Leu Pro Cys Gly Glu 
50 55 6* 

Ser Glu Phe Leu Asp Thr Trp Asn Arg Glu Thr His Cys His Gin His 
65 70 75 80 

Lys Tyr cys Asp Pro Asn Leu Gly Leu Arg Val Gin Gin Lys Gly Thr 
85 9° 95 

Ser GVj Thr Asp Thr He Cys Thr Cys Glu Glu Gly Trp His Cys Thr 

105 HO 



100 



Ser Glu Ala Cys Glu Ser Cys 

115 120 



Val Leu His Arg Ser Cys Ser Pro Gly 
125 



Phe Glv Val Lys Gin lie Ala Thr Gly Val Ser Asp Thr He Cys Glu 
130 135 140 

P-o Cys Pro Val Gly Phe Phe Ser Asn Val Ser Ser Ala Phe Glu Lys 

1« 150 .155 160 

Cys His Pro Trp Thr Ser Cys Glu Thr Lys Asp Leu Val Val Gin Gin 
165 1 7 0 175 

Ala Gly Thr Asn Lys Thr Asp Val Val Cys Gly Pro Gin Asp Arg Leu 
180 185 190 

Arg Ala Leu Val Val He Pro He He Phe Gly He Leu Phe Ala He 
195 200 205 

Leu Leu val Leu Val Phe He Lys Lys Val Ala Lys Lys Pro Thr Asn 
210 215 220 



S ■ I. 



*rrr 



4 

s ' - .4 



104 

Lys Ala Pro His Pro Lys Gin Glu Pro Gin Glu He Asn Phe Pro Asp 
225 230 235 240 

Asp Leu Pro Gly Ser Asn Thr Ala Ala Pro Val Gin Glu Thr Leu His 



245 



2S0 



255 



Gly Cys Gin Pro Val Thr Gin Glu Asp Gly Lys Glu Ser Arg He Scr 
260 265 270 

Val Gin Glu Arg Gin 
275 

(2) INFORMATION FOR SEQ ID NO: 15: 

ii) SEC/JZ NCE CHARACTERISTICS: 

(Ai LENGTH: 255 amino acids 
(Bj TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



; ii ) MOLECULE TYPE: protein 



f 



ixi) SECL"£NCE DESCRIPTION: SEQ ID NO: 15: 

Met Gly Asn Ser Cys Tyr Asn lie Val Ala Thr Leu Leu Leu Val Leu 
15 10 15 

A3-.. Phe Glu Arg Thr Arg Ser Leu Gin Asp Pro Cys Ser Asn Cys Pro 



20 



25 



30 



Ala Gly Thr Phe Cys Asp Asn Asn Arg Asn Gin He Cys Ser Pro Cys 
35 40 45 

Pro Pro Asn Ser Phe Ser Ser Ala Gly Gly "Ir Arg Thr Cyc Asp He 
50 55 6° 

r vs Arc -Tin Cys Lys Gly Val Phe Arg Thr Arg Lys Glu Cys Ser Ser 
65 ~ 70 75 80 

Thr Ser Asn Ala Glu Cys Asp Cys Thr Pro Gly ?h« His. Cys '.eu Gly 
85 95 

Ala G2y Cys Ser Met Cys Glu Gin A: i* ^ys Ly- Gin Gly Gin Glu Leu 
100 105 11° 

Thr Lys Lys Gly Cys Lys Asp Cys Cys Phe Gly Thr Phe Asn Asp Gin 
1*15 120 125 

Lys Arc Gly He Cys Arg Pro Tip Thr Asn Cys Ser Leu Asp Gly Lys 
13 ; 135 140 



1 




Q 



105 

Ser Val Leu Val Asn Gly Thr Lys Glu Arg Asp Val Val Cys Cly Pro 
145 150 1SS 160 

Ser Pro Ala Asp Leu Ser Pro Gly Al* Ser Ser Val Tnr Pro Pro Ala 
165 l^O 175 

Pro Ala Arg Glu Pro Gly His Ser Pro Gin lie He Ser Phe Phe Leu 

185 130 



180 



Ala Leu Thr Ser Thr Ala Leu Leu Phe Leu 



ISfS 



200 



Leu Phe Phe Leu Thr Le*. 
205 



Arg Phe Ser Val Val Ly 

210 215 



ys Arg Gly Arg Lys Lys Leu Leu Tyr He Phe 



220 



Lys Gla Pro Phe Met Arg Pro Val Gin Thr Thr Gin Glu Glu Asp Gly 
230 235 



125 



Cys Ser Cys Arg Phe Pro 



Glu Glu Glu Glu Gly Gly Cys Glu Leu 



245 



250 



255 



i,:^:.^ ^ .... 



(2; INFORMATION FOR Z7Q ID NO : 16 : 

(i) SEQUENCE CHARACTERISTIC?: 

(A LENGTH: 277 amino acids 
^3) TYPE: amino acid 
iC) STRAN3EDNE5S : Single 
iD) TOPOLOGY: linear 

(ii) MCLECULE TYPE: protein 



4- 

% 



V 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Cvs Val Gly Ala Arg Arg Leu Gly Arg Gly Pro Cys Ala Ala Leu 
1 * 5 10 15 

L-u Leu Leu Gly Leu Gly Leu Ser Thr Val Thr Gly Leu His Cys Val 



2S 



30 



Gly Asp Thr Tyr Pro Ser Asn Asp Arg Cys Cys His Glu Cys Arg Pro 
35 ^ 45 

Gly Asn Glv ;/< l Val Ser Arg Cys Ser Arg Ser Gin Asn Thr Val Cys 
50 " 60 

Arg Pro Cys Gly Pre Gly Phe Tyr Asn Asp Val Val -:r Ser Lys Pro 

75 80 



65 



70 



Cys Lys I ro Cys Thr Trp Cys Asr. 



Leu Arg Ser Gly Ser Glu Arg Lys 



85 



90 



95 



tin I 



106 

Leu Cys Thr Ala Thr Gin Asp Thr Val Cys Ar 9 Cy 9 Arg Ala Gly 

100 "5 ll ° 

Gin Pr, Leu Asp Ser Tyr Lys Pro Gly Val Aap Cy» Ala Pro Cy. 



120 i=* 



p.~ Gly His Phe Ser Pro Gly Asp Asn Gin Ala Cys Ly* Pro Trp 
130 135 

Asn Cys Thr Leu Al- Gly Lys His Thr Leu Gin Pro Ala Ser A=n 
150 «* "0 



Ser Asp Ala He Cys Glu Asp Arg Asp Pro Pro Ala Thr Gin Pro 

lie Thr Val Gin Pro Thr 



1" 

Glu Thr Gin Gly Pro Pro Ala Arg Pro 

180 1*5 lW 

Ala Trp Fro Arg Thr Ser Gin Gly Pro Ser Thr Arg Pro Val Glu 
1? c 200 r0S 

Pro _ 

210 =15 "° 

3ly Leu Leu Giy Pro Leu Ala He Leu Leu Ala Leu Tyr Lou Lou 
230 135 

Arc Asp Gin Arg Leu Pro Pro Asp Ala Hid Lyn Pro Pro Gly Gly 
245 =50 



Gly Gly Arg Ala Val Ala Ala lie Leu Gly I.ou Gly Leu Val 



26S 2*0 



Ser Phe Arg Thr Pr, lie Gin Glu Glu Gin Ala A- p Ala His Ser 

260 

Leu Ala Ly--. He 



;rmat : os fc~. seq id no : : 7 : 

, A LZN 3TK 5 -i ? ,ir-. ; :;c a c I d 3 
r. -v- E - i - : r. > 4c:d 

. r 3TrJc;z cir:::-ii ; sir. ; e 

XCLETVLE TV F H : prccein 



se:ve:::e ~ei :f:pt:::: se: :d n::17 : 

Lvs Ser Val Le- Tyr Leu Tyr He Leu Phe Leu Ser Cys tie 





107 



lie Asn Gly Arg Asp Ala Ala Pro Tyr Thr Pro Pro Asn Gly Lys Cys 



20 



25 



Lvs Asp Thr Glu Tyr Lys Arg H.s Asn Leu Cys Cys Leu Ser Cys Pro 
35 *° 45 

Lys Thr Asn Thr Gin 



Pro 



Gly Thr Tyr Ala Ser Arg Leu Cy3 Asp Ser 



50 



55 



60 



Cys Thr Pro Cys Gly Ser Gly Thr Phe 
65 70 

Pro 



Thr Ser Arg Asn Asn His Leu 



75 



80 



Ala Cys Leu Ser Cys Asn Gly Arg Cys Asn Ser Asn Gin Val Glu 



85 



90 



Thr 



Arg Ser Cys Asn Thr Thr His Asn 



Arg He Cys Glu Cys Ser Pro 



100 



X05 



110 



Gly Tyr Tyr Cys Leu Leu Lys Gly Ser Ser 



Gly Cys Lys Ala Cys Val 



115 



120 



125 



Ser Gin Thr Lys Cys Gly lie Gly Ty 



r Gly Val Ser Gly His Thr Ser 



130 



135 



140 



al Gly Asp Val He Cys Ser Pro Cy 



s Gly Phe Gly Thr Tyr Ser His 



145 



150 



155 



160 



Thr Val Ser Ser Ala Asp Lys Cys Glu Pro Val Pro Asn Asn Thr Pne 



Asn Tvr He Asp Val Giu lie Thr Leu 

185 



180 



Tyr Pre Val Asn Asp Thr Ser 
190 



C s Thr Arg Thr Thr Thr Thr Gly Leu Se; 
195 :o ° 



Glu Ser He Leu Thr Ser 
205 



Glu Leu Thr He Thr Me 



.et Asn His Thr Asp Cys Asn Pro Val Phe Arg 



210 



215 



220 



125 



Tvr Phe Ser Val Leu Asn Lys Val Ala Thr Ser Gly Phe Phe 
2 30 =35 



Thr Gly Glu Asn Arg 



Tvr Gin Asn He Ser Lys Val Cys Thr Leu Asn 



:45 



: so 



:55 



Phe Glu He Lys Cys Asn Asn Lys Gly Ser Ser 



260 



265 



Phe lys Gin Leu Thr 
270 



Lvs Ala : ys Asn Asp Asp Gl 

2 - 5 280 



y Met Met Ser His Ser Glu Thr Val Thr 



:85 



Leu Ala Gly Asp Cys Leu Ser Ser Val Asp He Tyr He Leu Tyr Ser 



L3" 



:95 



300 



it * 



V 



110 

GAGTTTGACC AGAGATGCAA GGGGTGAAGG AGCGCTTCCT 
CAGXKCGCCC CGGCCGCCTG ATGGCCG* 3G CAGGGTGCGA 
CGGGAACCAT ACCATG3CCC GGATCCCCAA GACCCTAAAG 
GGTCCTGCTG CCAGTCCTAG CTTACTCTGC CACCACTGCC 
CAGNCANTCG NCCCACAGCA ACAGNGGCAC AGTTTCAAGG 
GTTTTTATAG TTCAG AACNT ATTGGNCCTN TNAACCCTTG 
CAANG7TTCC AANATGNACT TTTTNGTTCC CTGTTANATT 
ATTTNTNAAC CTTNCCNGGG NAAATT 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



ACCGTTAGGA 
CCCAGGACCC 
TTCGTGGTCG 
CGGCAGAGGA 
GGGNAGGAGT 
CACAAGGGTT 
TTTTAATTAG 



ACTCTGGGGA 
AGGACGGCGT 
TCATCGTCGC 
AGTTNCCCAG 
TTTCCANCAA 
TGGNTTAAAC 
TTNAANTTAA 



120 
180 
240 
300 
360 
420 
480 
506 



a* 



V<* < *s 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GGCAGA3GTC TCTCCAGCCT GGCTCTATCT TCCTCCTTGT NATCGTCCCA TCCCCACATC 
CCGTGCACCC CCCAGGACCC TGGTCTCATC AGTCCCTCTC CTGGAGCTGG GGGTCCACAC 
ATC7CCCAGC CAAGTCCAAG AGGGCAGGGC CAGTTCCTCC CATCTTCAGG CCCAGCCAGG 
CA3GGGGCA3 TCGGCTCCTC AACTGGGTGA CAAGGGTGAG GATGAGAAGT GGTCACGGGG 
A7T7AT7CA3 CCTTGGTCAG AGCAGAACAC AGATTTTTCC GTGTGTTGGT TTTTACTCTO 
NTTCCCCTTC TTNATNCCCC TTTCN 
(2) INFORMATION FOR SEQ ID NO : 2 1 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 base pairs 
:B> TYPE: nucleic acid 
(C) STRANDEDNESS : single 
<0) TOPOLOGY: linear 



60 
12 0 
180 
240 
300 
325 



{ii) MOLECULE TYPE: DNA (genomic) 



• • • • 




C4 



in 



240 
300 
340 



(ii) MOLECULE TYPE : DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CCAGGGTCTC CTNCCCCACC TGCTGAAGAG ACANTGACCA CCAGCCCGGG GACTCCTGCC 
tctt cctcat tacctctnat CNANCATCGT AGGGATCATA GTTCTAATTG TGCCTTCTGA 
ATTGTGCTTT G7TTGGAAAG ACTTCAC1 GT GGGAAGAAAT TCCTTCCTTA CCTGAAGTTG 
CA33TAGGC c 7T3GGTNAGG GCGNGGGGCG CTGGACANTN TCTGGNCCTG GCTGCCCGCT 

-.2 INFORMATION FOR SEQ ID NO : 2 3 : 

;i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 497 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
iD) TOPOLOGY: linear 

;: ; MOLECULE TYPE: DNA (genomic) 



ixi- SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GGCACGAGCA GGGTCCTGTN 7CCGCCCTGA GCCGCGCTCT NCCTGCTCCA GCAAGGACCA 



SI'S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: *V.^r( 
GGCAGAGGCC CCAGCTGCTG AACAGACAAT AATCACCAGC CCGGGGACTC CTGNNTCTNC 60 
TOATTACCTC TNATGCACCA TCGTAGGGAT CATAGTTCTA ATTGTGCCTT CTAATTGTTT 120 
TTG TTTG AAA AGANTTCACT GTGG AAG AAA TTCCTTCCTT ACCTGTAAGT TNCAGGTAGG 180 
NGCCTGGCTG AGGGCGGGGG GCGCTGGTAC ACTCTCTGAC CCTGCCTCCC T CTG NCTGTT 
TTCCCACAGA CAGAAACGCC TGCCCCTGNC CCCAAGTTCC TNGTGTTTTC CAGCCTGGCT 
C-ATCTTNNC TCCTTGTGAA TCGTTCCCAT CCCCACANGC 
(2) INFORMATION FCR SEg ID NO: 22: 

v • <y j i 

(i) SEQUENCE CHARACTERISTICS: ^ f ^ V ^ 

(A) LENGTH: 241 base pairs f • > 

(B) TYPE: nucleic acid - \ 

(C) STRANDEDNESS : single >'* O 

(D) TOPOLOGY: linear ia^M 
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TGAGGGCGCT GGAGGGGCCA GGCCTCTCGC TGCTGTCCTG GTGTTGGCGC TGCCTGCCCT 120 jj 
GCTGCCGGTG CCGGCTGTAC GCGGAGTGGC AGAAACACNN ACNTACCCCT GGCGGGACGN 

*l 7T* AGAGACAGGG GAGCGGC7GG TGTNTNCCCA NTGCCCCCAG GCACCTTTNT GCAGCGGCCG 240 

P TGCCGSCGAG ACAGCCCCAC GACGTGTGGC CCGTNTCCAC CGCGCCACTA CACGCATTCT 300 

[, CGAACTACCT GGAGCGCTGN CCTTACTNCA ACG7CCTCTG CGGGGAGCGT NAGGAGGAGG 360 

CACGGGTTTN CCACGNCAAC CACAACCCNG GNTTACCGTN GCCGNACCGG TTTCTTCCNG 420 

» " GCAAGTTGGT TTTTNNTTTG GAGNAAGGAT TCGTGTTNCA ATTNATTJAC GNAC-TGATTN 480 



NNCNCGGGAA ACTNAAA 

(2) INFORMATION FOR SfcQ ID NO: 24: 



497 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGCAACTGCA CGGCCCTGGG ACTGGCCCTC AATGTGCCAG GNTCTTCCTC CCATGACACC 6 0 

CTGT ^ CACCA GCTGCACTGG CTTCCCCCTC AGACCAGGGT ACCANGAGCT GAGGAGTGTG 120 



> ' A3"V-3CCGT CATCGACTTT TTGG CTTTCC AGGACATCTC CATCAAGAGG CTGCAGCGGC 

7 3 C7 C A_\'G C C 
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190 



INFOF-V-ATION FOR SEQ ID NO: 2S: 

(1) SEQUENCE CHARACTER I STICS : 
(A) LENGTH: 2 7 base pairs 
{ B ) TYPE: nucleic acid 
(C) 5TRANDEDNESS : single 
O) TOPOLOGY: linear 

In- MOLECULE TYPE: DNA (genomic) 



'xi i SEQUENCE "ESCF.IPTIGN: SEQ ID NO:2S: 
COtGGATCCA CCATTGCCCG GCAGGAG 



f". 



r (i) SEQUENCE CHARACTERISTICS. . m 

* t^. • ■ ^ (A) LENGTH: 190 base pairs Vc > 

' (3) TYPE ■ nucleic acid 

i { c ) STRANDEDNESS : single 



iZ) TOPOLOGY: linear l'.'^VS 

I . " 

t~ i H) MGLECULE TYPE: DNA (genomic) 

r - - - 



113 



INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3C base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
( 2 ) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 



(xi) SEQUENCE TESCRJPTICN: SEQ ID NO:26 

'CTAGAC TAGTAATGAG A..GAGGCAGG 

INFORMATION FOR SEQ ID NO: 27: 

CI 1 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
',3) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(,D) TOPOLOGY: linear 

tii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2 

CCATGGC AGAAACACCC ACCTAC 

INFORMATION FCR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
,A) LENGTH: 26 base pairs 
.3) TYPE: nucleic acid 
\Z) STRANDEDNESS: single 
;D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



' X i ) SEQUENCE DESCRIPTION: SEQ ID NO : 2 

AGCTTC TCTTTCAGTG CAAGTG 

INFORMATION FCR SEQ ID NO: 29. 

(i) SEQUENCE CHARACTERISTICS : 
•A) LENGTH: 2 8 base pairs 



* < 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY linear 

(ii) MOLECULE TYPE : DNA (genomic) 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 
CGCAAGCTTC TCCTCAGCTC CTGCAGTG 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



28 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

CGCTCTAGAC CGCCATCATG GCCCGGATCC CCAAG 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
{>. x LENGTH: 36 base pairs 
;3! TYPE: nucleic acid 
;C) STRANDEDNESS: single 
iD) TOPOLOGY: linear 

(ii> MOLECULE TYPE: DNA (genomic) 



35 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

C3CGGATCCG CCATCATGAG GGCGTGGAGG GGCCAG 

: ; ! information foh SEQ ID NO : 32 : 

(1) SEQUENCE CHARACTERISTICS: 
;A) LENGTH: 3 0 base pairs 
C b ) TYPE: nucleic acid 
.:c) STRANDEDNESS : single 
-.D) TOPOLOGY: linear 



36 



V? 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3." 

C TAG AC T AGT AATG AG AAGAGGCAGG 

INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 
iBi TYPE: nucleic acid 

(C) STRAND EDNE3S : single 

(D) TOPOLOGY: linear 



30 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

GTACCC TCTTTCAGTG CAAGTG 

INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



26 



■xi) SEQUENCE DESCRIPTION : SEQ ID NC:34: 

GGTACCC "CTCAGCTC CTGCAGTG 

INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 34 base pairs 
S; TYPE : nucleic acid 
(C) STRANDEDNESS : single 
■D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DNA (genomic) 



116 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO- 35: 
CGCGAATTCC GCCATCATGG CCCGGATCCC CAAG 
C) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE ; nucleic acid 

(C) strantedness : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



34 
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(xi* SEQUENCE DESCRIPTION: FLQ ID NO:36: 

GCGTCTAGAG TAATGAGAAG AGGCAGG 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 3 5 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
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{xi ) SEQUENCE DESCRIPTION : SEQ ID NO:3">: 
CGCTCTAGAC CGCCATCATG GCCCGGATCC CCAAG 



(2) information FOR SEQ ID NO : 3 3 : 




•:i) SEQUENCE CHARACTERISTICS: 
'.A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDErNESS : single 
iD) TOPOLOGY- linear 

;ii> MOLECULE TYPE : DNA (genomic) 



(xi> SEQUENCE DESCRIPTION : SEQ ID NO: 38: 
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What is Claimed h: 

1 . An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95%. identical to a sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding a TNFR polypeptide having the complete amino 
acid sequence in SEQ ID NO 2. 4 or 6 or as encoded by a cDNA clone contained in ATCC 
Deposit No. 97798, 978 1 0 or 97809; 

(b) a nucleotide sequence encoding a mature TNFR polypeptide having an amino acid 
sequence at positions 2^-259. 31-300 or 31-170 in SIZQ ID NO:2. 4 and 6, respectively, or as 

,...JcJ oy the cDNA clone contained in the ATCC Deposit No. 97798. 97810 or 97809; 

\c) a nucleotide sequence encoding the soluble extracellular domain of a TNFR 
po t ncptidc having the amino acid sequence at posit.ons 27-240, 3 1 -283 or 3 1 - 1 66 of SEQ ID 
NC ^2.4 and 6. respectively; and 

(d) a nucleotide sequence complementary' to any of the nucleotide sequences in (a), 

{b). or (c) above. 

2. The nucleic acid molecule of claim I wherein said polynucleotide has a 
complete nucleotide sequence selected from the group consisting of SIZQ ID Nt . SEQ ID 
NO:3 and SEQ ID NO 5. 

3. The nucleic acid molecule of claim I wherein said poN nucleotide has a 
nucleotide sequence which encodes a TNFR polypeptide having a complete amino acid 
sequence selected from the group consisting of SEQ ID NO:2. SEQ ID NO:4 and SLQ ID 

NO:6. 

4. The nucleic acid molecule of claim I wherein said polynucleotide has a 
nucleotide sequence encoding the mature form of a TNFR polypeptide having an amino acid 
sequence from about 27 to about 259 in SEQ ID NO:2, from about 31 to about 300 in SEQ 
ID NO:4. or from about 31 to about I 70 in SEQ ID NO:6. 



5. The nucleic acid molecule of claim I whercn said polynucleotide has a 
nucleotide sequence encoding the soluble extracellular domain of a TNFR polypeptide having 
the amino acid sequence from about 27 to about 240 in SEQ ID NO:2. from about 3 1 to about 
2S3 in SEQ ID NO:4. or from about 31 to about 166 of SEQ ID NO:6. 

6. An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95% identical to a sequence selected from the group consisting of: 

<a> a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of csiducs m-259 of SEQ ID NO:2, where m is an integer in the range of 1-53; 

(b) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues n-300 of SEQ ID NO:4. where n is an integer in the range of 1-49; 

(c) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues n-170 of SEQ ID NO:6. where n is an integer in the range of 1-49; 

(d) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues 1-x of SEQ ID NO:2, where x is an integer in the range of 149-259; 

te) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues l-y of SEQ ID NO:4. where y is an integer in the range of 193-300; 

( f) a nucleotide sequence encoding a polypeptide comprising the amino acid 
sequence of residues 1-z of SEQ ID NO:6, where z is an integer in the range of 132-170; and 

(g) a nucleotide sequence encoding a polypeptide having the amino acid sequence 
conning of residues m-x of SEQ ID NO:2. n-y of SEQ ID NO:4. or n-z of SEQ ID NO:6 as 
m. n. x. y and z are defined in (a), (b). (c). (d). (e) and (0 above. 

7. An isolated nucleic acid molecule comprising a polynucleotide having a 
nucleotide sequence at least 95° o identical to a sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding a polypeptide consisting of a portion of a 
complete TNI R amino acid sequence encoded by a cDNA cone contained in ATCC Deposit 
\n. 9~""9X. 97X10 or 9*7X09 wherein said portion excludes from 1 to about 52. from I to 
about 4X and from 1 to about 4X amino acids from the amino terminus of said complete 
aimrui.acid 'sequence encoded by the cDNA clone contained in ATCC Deposit No. 9779X. 
Old and V7MW. respectively; 



(b) a nucleotide sequence encoding a polypeptide consisting of a portion of a 
complete TNFR amino acid sequence encoded by a cDNA clone contained in ATCC Deposit 
No. 9"98. 9~810 or 97S09 wherein said portion excludes from I to about 1 10, from I to 
about 107. or from 1 to about 38 amino acids from the carboxy terminus of said complete 
ammo acid sequence encoded by the cDNA clone contained in ATCC Deposit No. 97798, 
97S 1 0 and 97809. respectively; and 

(c) a nucleotide sequence encoding a polypeptide consisting of a portion of a complete 
TNFR amino acid sequence encoded by the cDNA clone contained in ATCC Deposit No 
97-QS. 97SI0 or 97809. wherein said portion includes a combination of any of the amino 
terminal and carboxy terminal deletions for the respective clones in (a) and (b), above. 

8. The nucleic acid molecule of claim I wherein said polynucleotide has the 
complete nucleotide sequence of the cDNA clone contained in ATCC Deposit No. 97798, 
978l0or97S09. 

9. The nucleic acid molecule of claim 1 wherein said polynucleotide has the 
nucleotide sequence encoding a TNFR polypeptide having the complete amino acid sequence 
encoded by the cDNA clone contained in *TCC Deposit No. 97798. 97810 or 97809. 

10. The nucleic acid molecule of claim I wherein said polynucleotide has the 
nucleotide sequence enc^mu a mature TNFR polypeptide having the amino acid sequence 
encoded by the cDNA clone contained in ATCC Deposit No. 97798, 97810 or 97809. 

M. An isolated nucleic acid molecule comprising a polynucleotide which 
hybridizer under stringent hybridization conditions to a polynucleotide having a nucleotide 
.sequence identical to a nucleotide sequence in (a), (b). <c>. or <d) of claim 1 wherein said 
polynucleotide which hybridize* does not hybridize under stringent hybridization condition 
tn j polynucleotide ha\ mg a nucleotide sequence consisting of only a residues or of only T 



1 2. An isolated nucleic acid molecule com; rising a polynucleotide which encodes 
the amino acid sequence of an epitope-bcaring portion of a TNFR polypeptide having an 
amino acid sequence in (a), fb). (c) or (d) o*" claim I. 

1 3. The isolated nucleic acid molecule of claim 1 2, which encodes an 
epitope-bcaring portion of a TNFR polypeptide comprising amino acid residues selected from 
the group consisting of: from about Gln-42 to about Glu-52 in SEQ ID NO:2, from about 
His-5S to about Cys-66 in SEQ ID NO:2, from about Pro-68 to about Thr-76 in SEQ ID 

NO l from about Ser-79 to about Cys-85 in SEQ ID NO:2. from about Cys-9l to about Thr- 
102 in SEQ ID NO:2. from about Gln-1 10 to aoout Pro-122 in SEQ ID NO:2, from about 
Arg-126 to about Val-136 in SEQ ID NO:2, from about Thr-142 to about GIu-148 in SEQ ID 
NO:2. from about Ala-3 1 to about Thr-46 in SEQ ID NO:4, from about Phe-57 to about 
Thr-1 17 in SEQ ID NO:4. from about Cys-132 to about Thr-175 in SEQ ID NO:4, from 
about Gly-185 to about Thr-194 in SEQ ID NO:4, from about Val-205 to about Asp-217 in 
SEQ ID NO:4. from about P:o-239 to about Leu-264 in SEQ ID NO:4, and from about Ala- 
283 to about Pro-298 in SEQ ID NO:4. from about Ala-3 1 to about Thr-46 in SEQ ID NO:6, 
from about Phe-57 to about GIn-80 in SEQ ID NO:6, from about Glu-86 to about His-106 in 
SEQ ID NO:6. from about Thr-1 OS to about Phc-l 19 in SEQ ID NO:6. from about His- 129 
to about YaM3S in SEQ ID NO:6. and from about Gly-142 to about Pro-166 in SEQ ID 
NO h to about in SEQ ID NO:6 

14 A method tor making a recombinant vector comprising inserting an isolated 
nucleic acid molecule of claim 1 into a vector. 

1 5. A recombinant vector produced by the method of claim 14. 

lf> A method of making a recombinant hoM cell comprising introducing the 
rcLomhinant \ cctor of claim 15 into a hoM cell. 

I 7. A recombinant host cell produced by the method of claim 16. 



1 8. A recombinant method for producing a TNFR polypeptide, comprising 
cultunnc the recombinant host cell of claim 17 under conditions such that said polypeptide is 
expressed and recovering said polypeptide. 

1 Q. An isolated TNFR polypeptide comprising an amino acid sequence at least 
95° o identical to a sequence selected from the group consisting of: 

(a) the amino acid sequence of a full-length TNFR polypeptide having the 
complete amino acid sequence shown in SEQ ID NO:2, 4 or 6, or as encoded by a cDNA 
clone contained in ATCC Deposit No. 97798, 97810 or 97809; 

(b) the amino acid sequence of a mature TNFR polypeptide having the amino acid 
sequence at positions 27-259 in SEQ ID NO:2, 31-300 in SEQ ID NO:4, or 31-170 in SEQ 
ID NO:6. or as encoded by a cDNA clone contained in ATCC Deposit No. 97798, 97810 or 
9"S09; or 

(c) the amino acid sequence of a soluble extracellular domain ofc TNFR 
polypeptide having the amino acid sequence at positions 27 to 240 in SEQ ID NO:2, 31 to 
283 in SEQ ID NO:4, or 3 1 to 1 66 in SEQ ID NO:6, or as encoded by the cDNA clone 
contained in ATCC Deposit No. 97798. 97810 or 97809. 



I « 




20. An isolated polypeptide comprising an cpitope-bcaring portion of the TNFR 
protein, wherein said portion is selected from the group consisting of a polypeptide 
comprising amino acid residues: from about Gln-42 to about Glu-52 in SEQ ID NO:2, from 
about His-58 to about Cys-66 in SEQ ID NO:2, from about Pro-68 to about Thr-76 in SEQ 
ID NO:2. from about Ser-79 to about Cys-85 in SEQ ID NO:2, from about Cys-91 to about 
Thr-102 in SEQ ID NO:2. frcm about Gln-110 to about Pro-122 in SEQ ID NO:2, from 
about Arg-126 to about VaI-136 in SEQ ID NO:2, from about Thr-142 to about GIu-148 in 
SEQ ID NO:2. from about Ala-3 1 to about Thr-46 in SEQ ID NO:4, from about Phe-57 to 
about Thr-117 in SEQ ID NO:4. from about Cys-132 to about Thr-175 in SEQ ID NO:4, 
from about Gly-185 to about Thr-194 in SEQ ID NO:4. from about Val-205 io about Asp- 
217 in SEQ ID NO:4, from about Pro-239 to about Leu-264 in SEQ ID NO:4. and from about 
Ala-283 to about Pro-298 in SEQ ID NO:4, from about Ala-3 1 to about Thr-46 in SEQ ID 
NO:6. from about Phe-57 to about Gln-80 in SEQ ID NO:6. from about GIu-86 to about His- 
106 in SEQ ID NO:6. from about Thr-108 to about Phe-1 19 in SEQ ID NO:6. from about 
His-129 to about Val~13K in SEQ ID NO:6, and from about Gly-142 to about Pro-166 in SEQ 
ID NO:6 to about in SEQ ID NO:6. 
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21. An isolated antibody that bine's specifically to a TNFR polypeptide of claim 



22. An isolated p.ucleic acid molecule comprising a polynucleotide having a 
jquence at lea>t 95° o identical to a sequence selected from the group consisting of: 
<a> the nucleotide sequence of clone IIPRCB54R (SEQ ID NO: 19); 
<b> the nucleotide sequence of clone HSJAU57RA (SEQ ID NO:20); 
<c) the nucleotide sequence of clone HELBP70R (SEQ ID NO:21): 
id) the nucleotide sequence of clone IIUSCB54R (SEQ ID NO:22); 
(e) the nucleotide sequence of clone IIELDI06R (SEQ ID NO:23): 
if) the nucleotide *cquencc of clone I K hOW3NR (SIQ ID NO:24): and 

f y > .a nucleotide sequence complementary to any of the nucleotide sequences in 



(;m. <b). <c>. (d). (c). or ( fi above. 



Abstract 

The present invention relates to novel Tumor Necorsis Factor Receptor proteins. In 
particular, isolated nucleic acid molecules are provided encoding the human TNFR-5. -6a & - 
6{5 proteins. TNFR-5. -6a & -6(J polypeptides are also provided as arc vectors, host cells 
and recombinant methods for producing the same. The invention further relates to screening 
methods for identifying agonists and antagonists of TNFR-5. -6a & -6P activity. Also 
provided are diagnostic methods for detecting immune system-related disorders and 
therapeutic methods for treating immune system-related disorders. 



Figure 1 

CCTCTCCACGCGCACGAACTCAGCCAACGATTTCTGATAGATTTTTGGGAGTTTGACCAG 

AGATGCAAGGGGTGAAGGAGCGCTTCCTACCGTTAGGGAACTCTGGGGACAGAGCGCCCC 

GGCCGCCTGATGGCCGAGGCAGGGTGCGACCCAGGACCCAGGACGGCGTCGGGAACCATA 

CCATGG- :r CGG ATCCCC AAG ACCCTAAAGTTCGTCGTCGTC ATCGTCGCGGTC CTGCTGC 
— AR T PKTLKFVV Y I V A ¥ L J2 

CAGTCCTAGCTTACTCTGCCACCACTGCCCGGCAGGAGGAAGTTCCCCAGCAGACAGTGG 
V L A Y S A TTARQEEVPQQTVA 

CCCCACAGCAACAGAGGCACAGCTTCAAGGGGGAGGAGTGTCCAGCAGGATCTCATAGAT 
POQQRHSFKGEECPAGSHRS 

CAGAACATACTGGAGCCTCTAACCCGTGCACAGAGGGTGTGGATTACACCAACGCTTCCA 
EHTGACNPCTEGVDYTtlASN 
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ACAATGAACCTTCTTGCTTCCCATGTACAGTTTGTAAATCAGATCAAAAACA1AAAAGTT 
NEPSCFPCTVCKSOQKH-KSS 

CCTGCACCATGACCAGAGACACAGTGTGTCAGTGTAAAGAAGGCACCTTCCGGAATGAAA 
CTMTRDTVCQCKEGT FRNEN 

ACTCCCCAGAGA7GTGCCGGAAGTGTAGCAGGTGCCCTAGTGGGGAAGTCCAAG7CAGTA 
SPEMCRKCSRCPSGEVQVSN 

ATTGTACGTCCTGGGATGATATCCAGTGTGTTGAAGAATTTGGTGCCAATGCCACTGTGG 
CTSWPDIQCVEEFGANATVE 

AAACC C C AGCTGCTG AAG AG AC AATGAAC AC C AGCCCGGGG ACTCC TGCCCC AGCTGCTG 
T PAAEETMNTSPGTPAPAAE 

AAG AG AC AA TG A.=iC ACC AGCC CAGGG ACTCC TGCCCC AGCTGCTG AAG AG AC AATG AC C A 
ETMNTS PGTPAPAAEETMTT 

CCAGCCCGCGGACTCCTGCCCCAGCTGCTGAAGAGACAATGACCACCAGCCCGGGGACTC 
S PGTPAPAAEETMTTS PGTP 



CTGCCCCAGCTGCTGAAGAGACAATGACCACCAGCCCGGGGACTCCTGCCTCTTCTCATT 
APAAEETMTTSPGTPASSHY 
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Figure 1 (continued) 



ACCTCTCATGCACCATCGTAGGGATCATAGTTCTAATTGTGCTTCTGATTGTGTTTGTTT 
LSCTIVGIIVLIVLLIVFV* 

GAAAGACTTCACTGTGGAAGAAATTCCTTCCTTACCTGAAAGGTTCAGGTAGGCGCTGGC 

TGAGGGCGGC3GGCGCTGGACACTCTCTGCCCTGCCTCCCTCTGCTGTGTTCCCACAGAC 

AGAAACGCC7GCCCCTGCCCCAAGTCCTGGTG7C7CCAGCCTGGC7CTATCTTCCTCCTT 

G7GA7CG7CCCA7CCCCACA7CCCG7GCACCCCCCAGGACCCTGG7C7CA7CAG7CCC7C 

7CCTGGAGC7GGGGG7CCACACATCTCCCAGCCAAG7CC/AGAGGCAGGGCCAGTTCCTC 

CCA7C77CAGGCCCAGCCAGGCAGGGGGCAG7CGGC7CC7CAACTGGG7GACAAGGGTGA 

GGA7GAGAAG7GG7CACGGGA77TA77CAGCCTTGG7CAGAGCAGAACACAGAGATTTTC 

CG7GAAAAAAAA 
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GC7C7CCC7C5CTCCAGCAAGGACCATGAGGGCGC7GGAGGGGCCAGGCCTG7CGC7GCTG 

M B A L EGP GLSLL 



TGCCTGGTGTTGGCGCTGCCTGCCCTGCTGCCGGTGCCGGCTGTACGCGGAGTGGCAGAA 
C_I ii L & L E ALLPVPAVRGV A E 

ACACCCACC ACCCC7GGCGGGACCCAGAGACAGGGGAGCGGCTGG7G7GCGCCCAG7GC 
T P7YPW RDA E7GERLVCA QC 

CCrCCAGGCACCTTTGTGCAGCGGCCGTGCCGCCGAGACAGCCCCACGACGTGTGGCCCG 
P PGTFVQRPCRRDSPTTCGP 

7G7CCACCSCGCCAC7ACACGCAG77CTGGAAC7ACC7CGAGCGCTGCCGC7AC7GCAAC 
CPPRHY7QFWNYLERCRYCN 

G7CC7C7GCGGGGAGCG7GAGGAGGAGGCACGGGCT7GCCACGCCACCCACAACCG7GCC 
VLCGEREEEARACHATHNRA 

7GCCGC7GCCGCACCGGC7TC7TCGCGCACGC7GG77TC7GC77G GAGCACGCA7CG7GT 
CRCR7GFFAHAGFCLEHASC 

CCACC7GG7GCCGGCG7GA77GCCCCGGGCACCCCCAGCCAGAACACGCAG7GCCAGCCG 
P PGAGVIA PGTPSQNTQCQP 

7GCCCCCCAGGCACC77C7CAGCCAGCAGC7CCAGC7CAGAGCAG7GCCAGCCCCACCGC 
c PPGTFSA3SSSSEQCQPHR 

AACTGCACGGCCCTGGGCCTGGCCCTCAATGTGCCAGGCTCTTCCTCCCATGACACCCTG 
NC7ALGLALNVPGSSSHDTL 

73 r ACC AGCTGC AC7GGC77CCCCC7C AGC ACC AGGG " ACC AGGAGCTG AGG AGTGTG AG 
C7SCTGFPLSTRVPGAEECE 

CG7G7CG7CA7CGAC777G7GGC777CCAGGACA7CTCCA7CAAGAGGC7GCAGCGGC7G 
rAVIDFVAFQDISIKRLQRL 

C7G7AGGCCC7CGAGGCCCCGGAGGGC7C-GGGTCCGACACCAAGGGCGGGCCGCGCGGCC 
-QALEAPEGWGP7PRAGRAA 

T70CAGC7GAAGC7GCG7CGGCGGC7CACGGAGC7CC7GGGGGCGCAGGACGGGGCGC7G 
-W-KLRRRLTELLGAQDGAL 

C7GG7GCGGC7GC7GCAGGCGC7GCGCC7GGCCAGGA7GCCCGGGC7GGAGCGGAGCG7C 
-VP. LLQALRVARMPCLERSV 

■1'.:7',AGCGC77CC7CCC737GCAC7GA7CC7GGCCCCC7C77A77TA77C7ACA7CC77G 
r E ?. r L P V H • 

0~A^^A77T0CAC7GAAACA0GC77T77777AAATAGAAGAAA7GACG77TC77AAAG 
C7^7A77777A7AAAGC7T777CA7AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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' ATG7 CGG" C AGGC AC AGC AGCG7CCTGTGTCCGCGC7G AGCCGCGCTCTCCCTGCT 

CCAGCAAGGACCATGAGGGCGCTGGAGGGGCCAGGCCTGTCGCTGCT'GTGCCTGGTGTTG 
M — B — A — L E G P G L S L l r t. v r. 

GCGC7;JCC73CCC7GC7GCCGG7GCCGGCTGTACGCGGAG7GCCACAAACACCCACC7AC 
a - H & 1- L E 3£ 2 A V ft G V A E T P T Y 

CCC7GGCGG3ACGCAGAGACAGGGGAGCGGC7GG7G7GCGCCCAG7GCCCCCCAGGCACC 
PWRUhETGERLVCAQCPPGT 

777G7GCAGCGGCCS7GCCGCCGAGACAGCCCCACGACG7G7GGCCCG7G7CCACCGCGC 
FVQRPCRRDSPTTCGPCPPR 

CAC7AC A3GCA377C7r.3AAC7ACC7GCAGCGC7GCCGC7AC7GCAACG7CC7C7GCGGG 
KY7QFWNYLERCRYCNVLCG 

GAGCG7GAGGAGGAGGCACGGGC77GCCACGCCACCCACAACCG7GCC7GCCCC7GCCCC 
HP. EEEARACHATHNRACRCR 

ACCGGC77777CGCGCACGC7GG777C7GC77GGAGCACGCA7CG7G7CCACC7GG7CCC 
7GFFAHACFCLEHASCPPGA 

G3rG7GA77GCCCCGGG7GAGAGCTGGGCGAGGGGAGGGGCCCCCAGGAGTGG7CGCCGG 
"V1APGESWARGGAPRSGGP. 

AGC7G7GGCAGGGCTCAGG77GC7GG7CCCAGCC rTGCACCCTGAGCTAGGACACCAGTT 
r CGRGQVAGPSLAP* 

CC"73A37C7377C77CCC7CC7GGC7GCAGGCACCCCCAGCCAGAACACGCAG7GCCA 
'.. -7-7 COZC 73 CAGGC A CC 7 7C7C AGC C AGC AGC7CCAGC7CAGAGC AG 7GCCAGCC CCA 
3CC3"-GGCC7GGCCC7CAA7G7GCCAGGC7C7TCC7CCCA7GACAC 
3CAC7GCC77CCCCC7CAGCACCAGGG7ACCAGG7GAGCCAGAGGC 
- -~ 3 - - J ~AG'*ACAC7G7AG3CCAGGCCCAC77G7GC7C7CA'.7CC7GCCCC7GCACG 
7G7A.777A3CC7GA3GCA7 3CCAGC7GGC7C7GGGAAGGGGCCACA37GGA777GAGGG3 
. -AGGV'-7CCC77CAC7A^A7CCCCACCAAG7C7GCCC7C7CAGGGG7GGC7GAGAA7 77 
GGA7C7GA3 77A3COCArAGCC7CCCC7GGAGAGC7C7GGGAAAG7GGGCAGCAA7C7CC 
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Figure 3 (continued) 

TAACTGCCCGAGGGCAAGC7GGCTGGCTCCTCTGACACGGGGAAACCGAGGCCTGATGGT 
AACTCTCCTAACTGCCTGAGAGGAAGGTGGCTGCCTCCTCTGACATGGGGAAACCGAGGC 
CCAA7G77AACCAC7G77GAGAAG7CACAGGGGCAAG7GACCCCCT7AACATCAAG7CAG 
GTCCGGTCCATCTGCAGGTCCCAACTCGCCCCTTCCGATGGCCCAGGAGCCCCAAGCCCT 
7GCC7GGGCCCCC77GCC7C77GCAGCCAAGG7CCGAG7GGCCGCTCC7GCCCCC7AGGC 
CTTTGCTCCAGCTCTCTGACCGAAGGCTCCTGCCCCTTCTCCAGTCCCCATCGTTGCACT 
GCCC7C7CCAGCACGGC7CACTGCACAGGGATT7C7C7C7CCTGCAAACCCCCCGAG7GG 
G Z : - CAGAAAGCAGGG7ACC7GGCAGCCCCCGCCAG7G7GTGTGGGTGAAA7GA7CGGAC 
CGC "'JCC7CCCCACCCCAC7GCAGGAGC7GAGGAG7G7GAGCG7GCCG7CA7CGAC7T7G 
7C~r777CCAGGACA7C7CCATCAAGAGGAGCGGC7GCTGCACGCCC 
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